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INTRODUCTION 

l 

by 

1 L. F. Guseman, Jr. 

! 

I 

The organizational meeting for the NASA Fundamental Research Program in 

i 

"Mathematical Pattern Recognition and Image Analysis" (MPRIA) was held at the 

! , 

NASA/Johnson Space Center in August, 1982, At this meeting each of the fifteen 

! • 

principal investigators briefly outlined the goals of their particular proposed 

i 

research efforts. Most of the efforts (those outside NASA) had just been 

I i 

funded (July 16, 1982), and Investigations were just getting underway, 

t 

In order to gain a better understanding of and stimulate discussions 

! 

between the 'individual research efforts, it was decided to conduct two techni- 

I 

cal workshops at Texas A&M University about six months into the program. The 

I 

first workshop was held, January 27-28, 1983 and consisted of Investigators from 

the "Mathematics/Statistics" areas. The second workshop was held February 3-4, 

1 I 

1983 and consisted of investigators from the "Pattern Recognition" areas. 

I ! 

Each of the workshops was conducted in an informal manner. Most of the 

| ! 

time was spent in lively technical discussions about each of the research 

1 i 

efforts. Additional time was spent discussing the availability of data sets. 

I ! 

Dr. R. P. Heydorn announced the availability of a data tape that has been com- 

I i 

piled for use by the research teams. Details concerning the content and format 
of the tape are discussed in the document entitled "Fundamental Research Data 
Base" appearing in the.'Appendix of these proceedings. 

I ; 

Agendas and lists; of participants for the workshops appear in their 

{ ! 

respective* Proceedings. 



j 
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NASA/KPRIA WORKSHOP i-mWSTAT 

Texas A6M University 
January 27-28, 1983 
Room 510, Rudder Tower 


Thursday, January 27 ; 


8:00 - 

8:30 

Coffee and donuts 

8:30 - 

9:00 

Overview: Fundamental Research Program 
R. P. Heydorn, NASA/ JSC 

9:00 - 

10:15 

Estimating Proportions of Materials Using Mixture 
Models 

R. P. Heydorn and R. Basu, NASA/JSC 

10:15 - 

10:30 

Break 

10:30 - 

11:45 

Some 3-D Density Estimates 
David Scott, Rice University 

11:45 - 

1:00 

Lunch 

1:00 - 

2:15 

Random Field Models for Use In Scene Segmentation 
Manouher Naraghl, JPL 

2:15 - 

2:30 

Break 

2:30 - 

3:45 

FUN.STAT and Statistical Image Representations 
Emanuel Parzen, W. B. Smith and H. J. Newton, TAMU 

3:45 - 

4:00 

Break 

4:00 - 

5:15 

A Minimax Approach to Spatial Estimation Using 
Affinity Matrices 
Carl Morris, UT Austin 

Friday, 

January 28 


8:00 - 

8:30 

Coffee and Donuts 

8:30 - 

9:45 

Covariance Hypotheses for LANDSAT Data 
Charles Peters and H. P. Decell, Jr., University of 
Houston 

9:45 - 

10:15 

Break 
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Planning 


I! 


10:15 - 11:30 

A Hypothesis Test for the Rank of a Linear Minimum 
Sufficient Statistic 
Richard Redner, University of Tulsa 


11:30 - 1:00 

Lunch 

I 

1:00 - 2:15 

An Adaptive Technique for Fitting LANDSAT Data 
Larry Schumaker and L. F. Guseman, Jr., TAMU 

1 

1 

» 

2:15 - 4:00 

Other Presentations, Discussion, and Symposium 
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ESTIMATING PROPORTIONS OF MATERIALS USING MIXTURE MODELS 


Richard P. Heydorn and Rekha Basu 
NASA Johnson Space Center 
Houston, Texas 

1.0 INTRODUCTION 

Let F ■ (f^; £e R^} be a family of probability density functions and 
let G be a distribution function on R N . For the given G we define a 
mixture density h as 

h - / f^dCU) (1) 

Since a 1 ! the members of F are used in this definition, it makes sense 
to say tho. F defines a mapping, say £, from the set of all G- 
distributions, say G, to the set of all induced h-densities, say H. 
If F;G ♦ H is one-to-one and onto then we say that H is identifiable . 
This formula Ion is essentially due to Teicher (1). Thus, identlfia- 
bility implies that, for a given mixture density h, a knowledge of the 
family F \ ill allow us to uniquely determine G. This has practical 
implications for estimating the proportion of a material class on the 
ground using remotely sensed observations of that material. To illus- 
trate the point, we offer the following example. 

Suppose we are given spectral measurements, x, of points (pixels) on 
the ground which have been obtained from 3 satellite-multispectral 
scanner system. We inagine that these x's are observations on some 


PRECEDING 
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I 

i : 

random variable X distributed according to density h. Suppose that 
through experimentation we have found that a given material class on 
the ground gives rise to measurements that are normally distrlb :ed as 
N(*;w,o) but that the means w, and variances change from region-to-region 
or from year-to-year. We know that in a given region there is a finite 
number of material classes, and that these classes can be described by 
members of a normal family; however, the means and variances are unknown. 
The mixture model that applies to this case is 

i 

i 

| 

h ; 1 -U/2)(x-m) 2 /(a$) 

>cx) - i Xj e j (2) 

J*! 1 J 

i I 

where in this example G assigns a point probability Xj to the points 

(Wj, Oj), j“l,2,'... ,M. This is an example of a finite mixture model . 

Since the H material classes are associated with the parameters (Mj, Oj), 

J-l,2 M, X< can be considered as the a-priori probability of ob~ 

i J 

serving the j-th class or Xt is the proportion of the j-th class present 
I J 

in the given region. The primary aim is to determine the X. -values 
l 1 J 

but to do that one has to estimate H, pj, o j , j-l,2,...,M. Studies 

within the AgRISTARS program suggest that a multivariate version of the 

model given in equation (2) fits reasonably well to agricultural data, 

c.f.^ Lennington et al. (2) as well as to data from national vegetative 

i 

classes. In those studies maximum likelihood estimation methods were 
used) to estimate the Xj's, the means, and the covariances. The number 
of classes, M, was determined by applying a heuristically derived 

r | 

algorithm. i 

! ! 

There is yet another point to be made about the use of a mixture model 

; i 

for this application. Given a sequence of unlabeled observations 

x l» x 2****» c ^ e proportions ai,X 2 ,...,X m can presumably be determined. 

! ' 

However, since the observations are unlabeled one cannot associate a 

i 
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class name to a given Xj-value. Thus, there Is a labeling problem 
associated with the use of the mixture model. 


Two possible approaches to this problem are suggested. If one can ob- 
tain a random sample of labeled observations, then It would be possible 
to consider approaches aimed at testing the hypothesis. 

"Xj belongs to material class k" for combinations of k and j. Since 
ground enumeration methods are often required to obtain labeled observa- 
tions, at least for foreign applications, this approach may be infeasible. 


Another approach to the problem is to use observations on some auxiliary 
random variable to predict the mean behavior of the x-observations. 

Thus, one might attempt to use the auxiliary variable to predict the 
mean for the class of interest, and then from equation (2) associate the 
lij which is closest to that prediction. If y is the auxiliary random 
variable, then a better approach might be to consider the bivariate 
mixture model 


M 

h(x,y) ■ I Xi fr (x,y) 

j-1 * 

Knowing f^ , one can determine the regression function values 

(X|Y - y) - / xf£ (x|y)dx, j«l,2,...,M 

J J 


For the class of interest, it is often possible to establish the regres- 
sion function, say Ei(x|Y), having historical observations on X and Y 
from areas similar to the one being observed. Thus picking a Xj that 
is associated with the class of interest is done bv matching the regres- 
sion function Ej(X|Y) to one E^(X|Y), j-l,2,...,M. 
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2.0 STUDY OBJECTIVES 


We intend to pursue studies which are aimed at developing an approach 
to proportion estimation based on the notion of a mixture model. 

Specific objectives are to: 

s) select appropriate parametric forms for a mixture model that appears 
to fit observed remotely sensed data, 

b) develop methods for estimating the parameters in these models, 

c) develop methods for labeling proportion determination from the mix- 
ture mode? , 

and as a possible fourth objective 

d) explore methods which use the mixture model estimates as auxiliary 
variable values in some proportion estimation scheme. 

This latter objective admits the possibility that the Xj-determinations 
may be only rough approximations to actual proportion, but are neverthe- 
less useful as part of some other estimation scheme. 

We have begun our studies by working on objective b) using the normal 
model form of equation (2). Our main purpose in mind is to develop 
methods for estimating M, since, in our opinion, least is known about 
estimating this parameter compared to the Aj-values, the means, and 
the covariances. Interestingly, the approach we are pursuing also 
leads naturally to an estimate of the means. 


/ 
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3.0 Szego'a Solution to the Trigonometric Moment Problem — The Case 
of Equal and Known Variances 


Consider a simple version of the mixture model in equation (2) in which 

I 

the variances are all equal to o and a is known. Thus, we have 


h(x) 


I 

m i_ -a/i)(x-4j ) 1 /(o 2 > 

X J v/^ Cj 


Taking the Fourier transform, we havn 

! M ' irniij -(1/2 )uj z c’- 

H(u) ■ Z Xi e e 

! J*1 : 


Thus _ ! 

I u 2 o z /2 M iupj 

D(u) ■ e E(u) - £ Xj e 

i i 3-1 

J I If 

viven the w-values — and letting d^. - D(oO we have a repre- 

i • 0 

sentation of the complex numbers d k as 

, . . . ,n (3) 

Carathecdory proved that given the complex numbers d[ ,d2»»»« ,d n whtra 
d k *0^ for some k, ; there exists an integer H, l<M<n, and constants Xj e* w j 
such: that Xj>0 and Uj*Uj, i* j , and the representation cf equation (3) 
holds and is unique. The problem of determining M, Xj, jij, j**l,...,M 

given the complex numberd d k , k«l,2,...,n is called the trigonometric moment 

* | 

problem. Norice that the uniqueness of the representation is a consequence 
of the identifiability of normal mixtures. 


M i iWj k 
d k - 2 X j(e J ) . 

j-l 


k-1,2 


| 

j 

l 


\ 

I 

1 


l 
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The piont of interest for this study is the proof of the Caratheodory 
Theorem as given in Greuander and Szego (3), since the proof gives a 
method for determining M and Uj, . ,M. To the best of our 

knowledge tie proof is due to Szego. We now sketch some of the ideas 
of the proof which will be of interest co us. 


Given the complex numbers d^ we construct the Herrritian matrix 

• • • d r 



1 d! 

d-i 1 


d 2 

d l 



• d—n • ......1 

where d_^ - 3^; i.e., the complex conjugate of d^. 

For the representation in equation {3) D can be expressed as 


H 

B« I L 

j-1 


trhich can be written as: 

I 



H 

© - r *4 


■ \ 
.-‘"A ' 

-nip /w 

V 3 7 


f, iW./“ nip /u> '\ 

\l t e J e i o) 


(4) 
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I ’ 

Thus ID is a linear combination of M ra- 1 matrices and so its rack 
trill not exceed M, and in fact ie H. Certainly M<n. 


Also from equation (A) we see that any Toeplitz form, u' ‘Ou is of the 


M 




' ©v' - l x . j E u v (e^'V I 2 
j j-l v-o 


Hence, if y are eigenvectors of D and u' tDu is a (say the first) zero 

, t 

eigenvalue then 

I 

0 - E A. | E J (e 1P J /tf °) v j I 2 

J-l J v-o 

i J 

} * 

Since D is a (n+l)x(n+l) matrix of rank less than or equal to n it mu3t 

! 

have at least one zero eigenvalue. Thus, It must be tha; the complex 
polynomial 


M l 

P(ZV - I Z v 


(assuming X.>0, j-l, 2,.. ,M) has roots Z> - e 

! i 

Summarizing the main points we have that: 


i Wj /o 0 


a) The tank of p is the number of components in our normal mixture 
I i 

model. | 


b) The roots of the (complex polynomial P(Z) lead to the means in our 
normal mixture model. 

I 

In passing we remark! that in the study of time series Pisarenko (A) 

1 „ 

applied Szego's approach. Refinements of the approach have also been 
proposed! by Reddy et al. (5). 



I 

! 
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4.0 The Case of Unknown and Unequal Variances 

The above appioach makes U3e of the following fact about Fourier trans- 
forms. If g(*+y) is a translate of g and g has the Fourier transform 
G(w) then the Fourier transform of its translate is C(u») Hence 
the above approach attempts to find the number of translations and 
their amounts; but, to do that, h must be transformed so that only the 
translates are retained in the Fourier transform. 


He pursue this general approach in considering the problem of unknown 
and unequal variances. We shall only sketch the main ideas of our 
approach. 

For convenience, we assume that /" xh(x)dx ■ 0 and define the 
truncation of h to be 


h b (x) - h(x)U(_ b>b )(x) 


where 


°(-b,b)<*> “ 


Since 


1, x e (-b,b) 
0, x t <-b,b) 


°<-b,b)(x) “ ^(-b-Mj.b-^jOc-lij) 

“ U (-b-ii j ,b+p j )(x-Uj) - I’d,-!. ib +ti )(x-Pj) 

“ u (-b-u j ,b+p J )(x-M j ) - U ( _^ tPj) (x-P j -b) 
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M I -<l/2)(xnii)*/(o 2 «) * 

> (x) " ^ X J Jl «o 3 * U (-b-Uj,b-hij)^*’' M J > 


M 

— I X i 


j # -d^Xx-ujWd^) 


J-l 




or If wo lot 


1 -d/2) £ 

r b£ x ^ " 0 2 J (“b-Uj 


bj v ' /2«o 


l -d/2) £ 

J V J 


( x) - ^ “ U 


o 2 , “(-M |t M 4 ) (x - b) 


Wo oco that h(, Is a mixture of the tritnslai.es of r^j and c^j which are 
truncated functions. 

To retain only the "translation Information" we will make use of tne 
Shannon Sampling Theorem. 


THEOREM (Slnnnon) 

Let g bo a function for which g(x) • 0 for x e (— »,-a)0(a,“). Then 

«■ 

C(u) » t C(-n^) 
r — cj 
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He can rewrite thia transform as 


2iuG(u) 


■ =-j “ i2a " 

- (ki * •“) 


-i2au 


.|i r kc 3ln(2au> -kj 

+ I _ £ 2a 0) - k 


! k— 


where 


» 

Cj. - G(^ . 


Lemma 


Let «»> - I Mfc °‘;rr-'k : 


|E Cvl < - , I* kC ki < “ then llm A(U) “ °* 

1 \ uy+4* 


1 I 

The Shannon Sampling Theorem plus the above lemma suggests that 2iuH b (u>) 
contains just the translation information for large values of <o. Indeed 
it! can be shov^i that this la the case. The main result is finally stated 

in* the following theorem. 


Theorem 


! M 4 iu6 fc j 

V(u) - ' E E a t1 e 3 

ll-l t-l J 


I 

where |a t .j| > 0 and 6 tj are real for j-1.2 M, t-1,2,3,4. Let 

mi - t+2*N, i 'and N integers. Then for n>M the matrix 

1 1 


I 



ORIGINAL PACE 13 
of poor? quality 


17 


t 


r 


r 


i: 

r. 




i 

i 

♦ 

i 

t 

i 

* i 


, 'e>wya*» a w » 


V(m©)» v <“l). • • • . V(u a ) 

B - / VCu-i), V(« 0 >, .... V(vi) 


V(u> 0 ) 

i 

has rank AH provided the constants 6 t j are distinct. Moreover, B + (U)' 
is Hermltian of rank AM (where denotes conjugate of and "o" denotes 

transpose of). 

Thus our approach is to build a Hermltian matrix from 2iwH|,(u), as in 
the above theorem, let N get large, and apply Szego method to compute 
M and the means. 

S.O Possible Extensions and Comments 


In the proceeding approach h can be written as the convolution 


M M 

h(x) -EX./ 

J-l 3 - 


-(l/2)(x-y) 2 /(a 2 j) 1 -<l/2)<y-Mj) 2 /fJ 2 


/2*B 2 


where + 0 2 - a^. The first term in the Integrand represents 
the "exponential decay" part of the mixture end the second the transla- 
tion part. Our approach required that we essentially eliminate' the con- 
tribution of the first term and preserve the contribution of the second. 
To do this, we made use of the exponential characteristics of the first 
term and the pure translation part of the second term. Thus, it would 
appear that representations that depend only on these two properties 
could also be approached by the above method. In particular one could 
consider a representation of the form 


h(x) 


H « 1_ -(l/2)fx-y ) 2 /(o 2 j) 

x ii JSS[ * 


where gj(y-Uj) 


-Y(y-yj) 

Ye , y>iij 


dy 




0, otherwise 
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which would give skewed components In the mixture. 

Finally, we recognize that the methods considered are most easily 
executed in one dimension. To handle the multidimensional problem, it 
may be posoible to: 

a) through transformations, develop vector valued random variables 
that have independent components. Then h could be considered as a 
product of marginal distributions. 

b) consider conditional mixtures. That is consider e.g. h (xfy) and 
solve the mixture problem for several fixed values of y. 

c) treat only Che marginal distributions and consider cases where this 
approach provides at least a good estimate of M. For such an 
approach one may be able to consider projections of the measurements 
which would attempt to bring out the true value of M. 
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Segment 

Number 

Ground Truth 
Proportion ( % ) 

Direct Proportion 
Estimate (%) 

] 1544 

26.81 

26.40 

J 1394 

41.48 

39.57 

1650 

13.73 

10.70 

1920 

15.99 

13.88 

1636 

i 50.16 

50.42 

1663 

53.98 

53.42 

1676 

7.06 

0.0 

1566 

! 37.32 

28.32- 

1899 

j 67.51 

59.03 " 

1825 

34.40 

| 

29.43*' 

g. G.T. Prop. 

- 34.84 



Bias 

Variance 


-3.7$ 

3.26 


Relative Bias «= -0.11 
Coefficient of 

Variation - .0.09 


Table 3.; Proportion Estimates of Small Grains 
i Obtained from the Mixture Model 



(a) 
ire 2. 



Ground truth distribution for pure small grains 
pixels from segment 1899. 

CLASSY estimated distribution for small grains 
fusing all pure pixels from segment 1899. 
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Nonparametric Probability Density Estimation 
for Data Analysis in Several Dimensions*^ 


David W. Scott 

Rice University 
Houston, TX 77251 


JL» Introduction 


Our purpose in this paper is to illustrate how nonparametric proba- 
bility density estimates, in particular the corresponding contour 
curves, are a useful adjunct to scatter diagrams when performing a prel- 
iminary examination of a set of random data in several dimensions. For 
a preliminary approach we generally want to perform fairly cimple tasks 
with free-form techniques to uncover structures and features of interest 
iq the data. Such procedures are often graphical and unlike summary 
statistics seldom lead to much compression of the data. Tukey (1977) 
presents a wealth of such procedures. One which well illustrates the 
power and flexibility of these preliminary procedures is the running 
median smoothing algorithm for time series data (with resmoothing of the 
rough and the like). Other graphical techniques for multivariate data 
are presented in Tukey and Tukey (1981). 


For preliminary viewing of one-dimensional data, both scatter 
diagrams and fiequency curves such as histograms are widely and success- 
fully employed to examine clustering, tail behavior, and skewness of 


This research was supported in part by the Army Research Office 
under DAAG-29-82-K-0014 and by HASA/Lockheed under PQ-0200100079. 
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data. For bivariate data* scatter diagrams are in practice widely pre- 
ferred to bivariate frequency curves. Scatter diagrams of three dimen- 
sional data may be realized by viewing a projection of the data on a 
rotating plane represented by the screen on a computer graphics termi- 
nal. For higher dimensions carefully selected projections may also be 
viewed* and sophisticated techniques have been developed, and are evolv- 
ing, for choosing good projections (Friedman and Tukey, 1974). 
Apparently the success of frequency curves in one dimension has not 
readily extended to higher dimensions. It is an open question as to the 
number of dimensions that may be successfully visualized with a non- 
parametric density estimator under various conditions (sample size, for 
example). It is our purpose to illustrate the power of preliminary fre- 
quency curves as an adjunct to viewing scatter diagrams. 

!• Bivariate. Data. 

We shall examine a data set which contains information on the 
6tatus of the coronary arteries of 371 men suspected of having heart 
disease, having experienced episodes of severe chest pain. These data 
have been more fully described and analyzed; see Gotto, fit. aJL, (1977) 
and Scott, fit al, (1978). After visual examination of the coronary 
arteries by angiography, 51 men were determined to be free of signifi- 
cant coronary artery disease. It was of interest to compare the levels 
of blood fats, plasma cholesterol and plasma triglyceride concentra- 
tions, between the group of 51 disease-free males and the group of 320 
diseased males. The scatter diagrams of these two data sets arc 
displayed in Figure 1. Patients with elevated levels of cholesterol and 
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triglyceride nre evident among the diseased wales. This observation is 
difficult to evaluate in light of the large difference in sample sires. 
However, it is unlikely thnt a larger sample of 320 disease-free tnaleo 
would result in a Rcntter diagram similar to that of the 320 diseased 
males . 


lo obtain a nonparnmetric density contour plot we computed a 
bivariate product kernel estimate (Epanechnikov, 1969) given by 


1 


n x.-x y. -y 

- srr .?, *<ir ni ir> 

x y i=l x y 


using a quart ic (biweight) kernel 


( 1 ) 


K<,>={f(l- S Vl [ . Jil3 0> (2 > 

and preliminary values of the smoothing parameters given by 
h^ = 2 s^ n where s^ represents a trimmed and pooled estimate ot the 
standard deviation for the two groups with a similar expression for h^. 
Density values were computed over a grid of 150 by 90 points. When 
applied to the data for the diseased males* the contour plot reveals a 
striking bimodal feature, a6 shown in Figure 2. The contours of equal 
probability are / the teh levels 0.05 to 0.95 in increments of 0.10 as 
a fraction of the respective maximal modal levels. The density function 
of the disease-free males could be well approximated by a bivariate Nor- 
mal form. Its mode coincides with the left of the two modes in the den- 
sity function of the diseased males. 


The contour plot., have helped emphasise a ! future in the Matter 
diagram that might have gone unnoticed. The contour plots also aid in 
compensating foi the difference in sample sixes. The discovery of the 
bimodal feature led to formulation of a complex cholesterol-triglyceride 



Triglyceride (mg/%) Triglyceride (mg/%) 
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interaction in the model ror estimating the risk of coronary artery 
disease. Clinically, the difference of 50 mg/% between the two modes in 
Figure 2 for the diseased males is greater than the reduction in 
cholesterol by dietary intervention (which usually achieves proportional 
reductions in the range of 10 to 15 percent). 

2. Iiivariatg fiata 

The data presented in this section were obtained by proepssing 
four-channel Landsat data measured over North Dakota during the summer 
growing season of 1977 and were furnished by Dick Hcydorn of 
NASA/Houston and Chuck Sorensen of Lockheed/Houston. The sample con- 
tains approximately 21,000 points, each representing a 1.1 acre pixel, 
covering a 5 by 6 nautical mile section. On each pass over an indivi- 
dual pixel by the Landsat satellite, the four channel readings were com- 
bined into a single value that measures the "greenness” of the pixel at 
that time. The greenness of a pixel was plotted as a function of time 
from the five passes during the growing season. Finally, Bndhvar's 
(1982) growth model was fitted to this curve. This model has three 
parameters which are contained in each trivarinte data point. The first 
variable (r) gives the time the "crop" (if any) ripened. The Gecond 
variable (y) measures the approximate time to ripen. And the third 
variable (z) measures the level of "greenness" at the time of ripening. 
Although it is natural to group these data by actual type of ground 
cover for clussii l:r . .on prococlui es, we have not done so here. 


r » 

v i 



It is not possible to present a satisfactory picture of a three- 
dimensional scatter diagram of these data for this article. However, on 
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an AED512 terminal with 512 by 512 resolution, a projection of these 
data onto the screen typically displayed only 4000 points, the rest 
being "hidden" behind displayed points. Yiewed from several different 
angles, various shapes and features in the data were easily perceived. 
Color was used to indicate the level of the variable perpendicular to 
the screen. 

We can present density contours of an estimate f(x,y.z). Consider 
an cquipiobablc contour at level c; that is, consider those points 
(x,y,z) satisfying the equation f(x,y,r) = c. The solution of this 
equation for a smooth density estimate f is a smooth surface (or sur- 
faces) in ft?. This surface may be displayed by intersecting it with a 
series of planes displaced equal distances along the co-ordinate axes, 
in the following, along only the x and y axes. In Figure 3, we display 
the surface for c = 12 of the maximal mode value. Comparing Figure 3 to 
the corresponding scatter diagram on the same projection plane revealo 
how surprisingly little of the data space is enclosed in this contour. 
In the scatter diagram our eyes focused on rays of points that seemed 
interesting but represented only a small fraction of the data. Also 
notable in Figure 3 is a cylindrical shape disjoint and behind the 
larger surface. This feature was also clearly visible in the scatter 
diagram and represents acres in which sugar beets were grown. 
Apparently the method by which sugar beets are harvested leads to a 
singularity in the estimation of the growth model parameters with y~0. 

Expanding the scale by a factor of 2 while retaining the same 
center as in the c : IX picture, we show the contour shapes at levels 
c = 10Z, 30T* and 3 OX of modal height. Notice how each contour chape 
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"fits" inside the preceding one. Also observe hot* multimodal features 
appear in this space. Three modes are shown in this sequence. On a 
color graphics terminal, we may simultaneously view these end other con- 
tours by using different colors to draw each contour. 

Again, the density plots have complemented and added to our under- 
standing of these data. It is easier to see inside the data cloud with 
this representation and als- makes rotation of the data cloud leBs 
important . 

A. Computational Considerations 

A new algorithm and density estimator were developed to display the 
trivariate contour plots and we hope to report on it in another paper 
(Scott. 1983b). Speed is an important factor in an interactive environ- 
ment. The kernel method used in the bivariate case becomea excruciat- 
ingly Glow when presented with 21,000 points in three dimensions. In 
real time, a few minutes were required on a Vax 11/780 to compute the 
bivariate kernel contours for 320 points on a 150 by 90 mesh. To gen- 
erate the pictures in Figure 3. we evaluated the density on a 30 by 30 
by 30 mesh for 21,000 points. A straightforward kernel estimator would 
have required several hours to computel 

The histogram estimator is extremely efficient computationally, but 
very inefficient statistically — and relatively more inefficient in 
higher dimensions than kernel methods. One recent discovery indicates 
that the frequency polygon may he a good choice of a nonpararaeti ic den- 
sity estimator since it is computationally equivalent to a histogram but 
statistically similai to a kernel estimate (Scott, 1983a). However, the 
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frequency polygon in several dimensions suffers from sensitivity to 
choice of cell boundaries. The new algorithm addresses this problem and 
is asymptotically equivalent to a certain kernel estimate. Other fast 
preliminary estimates in one and two dimensions may be obtained by 
numerical approximation of kernel estimates in place of statistical 
approximation* which we prefer. 

i. Where Da. Hfi. Co ? 

We do not really know for how many dimensions nonparametric density 
estimates will be useful and feasible. Scatter diagrams have been used 
in a highly interactive environment to visualize nine-dimensional data 
(Tukey* Friedman, and Fisherkeller. 1976). Many possible strategies may 
be envisioned for using color and motion to examine data in more than 
three dimensions. We expect much progress in this area. But for larger 
and larger data sets requiring sophisticated analysis, we believe that 
density-based methods will be both efficient and effective. 
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FUN.STAT and 

Statistical Image Representations 

I 

i 

Notes for Presentation by 

i 

j Emanuel Parzen 

institute of. Statistics, Texas A&M University 

i - 

NASA/fiPIRA Workshop (Hath. Stat.) 

| January 27-28, 1983 


Abstract 

! 

1 

Presentation consists of: (1) outline of general ideas of functional 

statistical ! inference analysis of one sample and two samples, univariate 


and bivaria 


univariate probability distributions of multi-spectral image data. 


HUL 


e, and (2) application of ONESAM program to analyze the 


I-SPECTRAL IMAGE DATA ANALYZED DY ONESAM FROGRAM 


Data analyzed consists of 9 files. Each file represents an observation 

; I 

at a different tine in the growing season of a geographical area measuring 

{ 

5 oy 6 naut cal miles divided into segments 117 across and 196 down, for a 
total of 22^32 picture segments. The successive files represent flights on 


days 


PRECEDING PAGE BLANK NOT FILMED 


L 4 iHHftctfitmty f&fflsa 


( 
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128, 145, 146, 163, 182, 199, 200, 235, 236. 

There are 4 measurements per element, representing 4 channels recorded 
by a multi -spectral scanner of sunlight reflection. The channels are 
respectively 4, 5, 6, 7 angstroms, going from visible to infrared to far 
infrared bands of the spectrum. The first two channels may be similar 
(highly related), and the second two channels nay be similar. 

For this area, ground truth data is available in which each element is 
divided into 6 subpixels. The data was originally collected for LACIE 
(Large Area Crop Inventory Experiment). 

There are 36 data sets representing 4 channels in 9 files. Each data 
set consists of 22392 integers (theoretically from 0 to 256) representing 
the reflections from surface elements. The data was received by us from 
Dr. Guseman in the form of 36 historgrams. The histograms were analyzed by 
our ONESAM program to determine the shape of the distribution fitting the 
histogram, and in particular to determine: (1) if the distribution is 

unimodal or bimodal; (2) the variation of medians and interquartile ranges. 
This research aims to contribute to, among other problems, digital 

image representation whose definition we quote. 

Digital image representati on is the determination and modeling of 
basic characteristics or features of the digital image which can be 
incorporated into the process of identifying classes and attributes 
in a scene. Approaches to the modeling of spatial image character- 
istics that require research include quantitative descriptions of 
image texture and the segmentation of images on the basis of spatial 
structure. Research is needed to determine the scene probability 
density functions and class conditional density functions of digital 
image data in order to understand spectral characteristics and extract 
desired information. Determination of density functions will enable 
the development of data transformation which reduce the dimensions of 
multi-variate image data while preserving information pertaining to 
scene classes and attributes. 
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Handout for remarks by Professor Emanuel Parzen 

FUNCTIONAL STAT ISTICAL INFERENCE 
FUN.STAT APPROACH TO DENSITY ESTIMATION 
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1 ' 


1. ONE SAMPLE: UNIVARIATE 

Let X be continuous random variable and Xj, X 2 »* ,,,X r) a ra ' ldotn samp,e 

1 

of X. To estimate distribution function 


f- x (x) = Pr [X_<x3 


and probability densitvl f (x) = F'(x), we estimate quantile funct 


ion 


Q x (u) = F x ^ * 


quantile density q (u) = QjUu), an£l density quantile 


- f x ( ?x (u)) ‘ 


1 F^rm sample distribution function F„ (x) , sample quantile function 

! 

(u) , sainplej quant i 1 e density q(u) at u = j/(n+l), j = l ,2, . . . ,n. 

[ | 

2. pjot sample version of informative quantile function 

I 
I 

= 0 ( 10 - 0 ( 0 . 5 ) , 

,(i(u) 2{Q(0.75) - Q(0.25lT 


whose values as u tends to 0 and 1 indicates the tail exponents 

I 1. . 

of the probability la/ of X. 


; i 


I 


I 

l 
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0F P0 ^'« quau,v 

3. Determine a standard distribution function F o (x) to test 


H : F (x) “ F (-^4 or Q{u) e V + o Q (u) 
o O C u 

for location and scale parameters \i and o to be estimated. 


d(u) = f 0 Q 0 ^ u ) <l( u ) * ° 0 

o » f 1 f Q (t) q< t) dt. 
0 v o o o 


which estimate respectively 


d (u) “ f Q (u) q(u) 1 cj 
o o o 

0 - f 1 f Q (0 q(t) dt. 

o 'O o o 


It. Form successive autoregressive estimators 


d (u) « K 
m m 


• * / t \ 2niu * / » 2n i urn -2 

1 + a \ 1 ) e • • • 4 o m l (n / e 

m m 


whose negentropy 


H « / - log d (u) du «* - log K 

m 'o m i 


m 


\ 


Form 


is used to determine optimal orders m. Note that estimates the 
entropy difference 
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A -{log o Q - log f 0 Q 0 (“)> - {’ fo lo 9 f( * {u)du) 

i i 

! 

5. Estimate fQ(u) by 

I 

hju) = f 0 Q 0 («i °o V u > 


where m is chosen equal to an optimal order m. 


2. TWO SAMPLE: UNIVARIATE 

Let X and Y be continuous random variables with random samples 

x X and y'. Y respectively, and wi th respective distribution functions 

A 1 • ' * ' ,A m l ’ ’ n j 


F(x) = Pr[X<x], • G(x) = Pr[Y<x], 


• • ,, v v 1 v Y can be reqarded as a random sample from 

The pooled sample can ue icya. 

. ! , . i 

the distribution function , 


H(x) *| X F(x) + (1-X) G(x), X = m+n 


To test the hvpc .heses of equality of distributions, 


V F 


x) * G (x) -IhU) , 


it is customary ii '-oi -parametric statistics to introduce 
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D„(u) “ F tl * (u) , Dy(u) ■ G H * (u) 


with densities 


X h H (u) 


<U«.) - 3-iL^i. 

' h H ' (u) 


Note that h H"'(u) =* X f H _, (u) + (i- ) g H - (u) ; therefore 


d x (u) 


1 f H 1 Cu) J 


A raw estimator of D ^( u ) ' s 


D x (u) = F H _ 1 (u) 


from which one can form 


p(v) = /' e 2lI,UV d D x (u) 


and autoregressive estimators d x m ( u ) of d x (u) . 

When one observes k variables X^ X^ one estimates 

(for j = l,...,k) the densities of Dj (u) •* F x yj (H (u)). 


1 


t 
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*X,,X 2 *V U 2* " f X,.X 2 (Q X, (u l } • C1 X 2 (u 2 )) 


To estimate fQ we define 


0 X,,X 2 ( V”2> ■ F X,,X 2 <<! X | (0 I K 0 X 2 (u 2» 


V.X, <“r u 2 > *rH— 0 <V“ 2 > 


I • u 2 


satisfying 


f \,X 2 (u l' u 2 ) = fQ X I (u l ) f % (u 2 } d X,,X 2 (u l* U 2 ) - 
To estimate d y from a random sample (X,^,X_^), j = l form 

A . * A 0 I L 


D X | ,X 2 " F X,,X 2 (Q X t Cu l 5 ’ Q X 2 (u 2 J) 


3 . ONE SAMPLE: BIVARIAT E 

Let (Xj, X 2 ) be jointly continuous random variables with distribution 


function 


F x x (X,, X 2 ) - Pr[X,<x , X 2 <x 2 ] 


and density f^ ^ (xj , x 2 ) . The joint density o.uantile function is defined by 


which is the distribution function c.' U| * F^. (Xj), U 2 * (X^) ; it has 

density 


f 
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I ' m 

and a raw estimator d x (u ,u ). We smooth log <* x x , (u l by 3 snooth 

* | 9 a 2 ■ c 12 


i 

estimator log d x x (a,^) minimizing a criterion slwljar to 


n 

l 

j-« 


| log dtU (j) ,U 2 (j) ] - log d^U, (J) ,U 2 (J) 1 1 2 


where log d (u.,uj has the parametric representation 
m, I l 


\ 

log d m (u r u 2 ) - 0 Vj>V2 exp i (u,v, + V 2 > ' ; 

! : i * 2 


where the summation is over v j» v £ D 


and ^(0 


Vj ,M 


) is an integrating factor to make d m (u, , u 2 ) a probability 

i ; 

density. The foregoing est imators have been implemented in the Ph.D. 
Thesis of tJ J. Woodfield. The problem of choosing a best value of the 

j 1 

order it. is approached b^ evaluating the entropy of d^. 

We expect Woodfield to work with us this summer to extend his results 

to estimation of multivariate density quantile functions. 


i,. T WO SAMPLES: BIVARIATE 

Let (xJ,X 2 ) and (Y^) be random vectors with respective distribution 
functions F(X )f X 2 > and G(X,,Y 2 ), and respective random samples 


(X 


, (j) X (j) ) j-l m and (Y (k) ,Y (k) ) . k-1.2 n. 


1 

l 
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Let denote the distribution function of the pooled random sample, 

with marginal distribution functions Hj(x.) and ll 2 (x 2 ). Define 


D 1 ( V U 2' " F(H j 1 < li l ) ' H 2 ! ( u ^ * 

d 2 ( u 1 »u 2 ) - G(H l 1 (uj), 


From D^Uj.u^} and D 2 ^ u l ,lJ 2^ one can form raw estimators dj(uj,u 2 ) and 
d_ (U| • c ) of the densities 


Vv 


U 2> 


d 2 ( “r 


“2> 


f <H" ' (u, ) , nj' <“ 2 ) > 
h|H,’(U|} hjHj'tuj) 

g(H" 1 (u, ) . H 2 ' (u 2 ) ) 

h,«i 1 {u, ) hjHj'lu) 


9 


Therefore 

log dj(uj,u 2 ) - log d 2 (u ) ,u 2 ) 

- log f(Hj , (uj), H 2 l (u 2 )) - Jog g(Hj'(Uj), H 2 '(u 2 >) 

The likelihood ratio f (x ^ »x 2 )/g(Xj ,x 2 ) can be effectively estimated by 
estimating log d|(U|,u ) - log d 2 ( u ^ , u 2 ) . We propose to investigate 
exponential model representations of 


log dj(uj,u 2 ) - log djj(uj) - log d, 2 (u 2 ) 
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where djj(uj) and d j 2 ^ u 2 ^ are l * ,e mnr 9' na ^ densities of dj(uj,u 2 ) whic 
can be estimated by methods of two samples: univariate. 

The final output are contour plots of the classification statistic 

L(Xj,x 2 ) = log f(xj,x 2 ) - log g(xj,x 2 ) . 

A point (x^,x 2 ) is classified in population 1 or 2 by whether L(Xj,x 2 ) 
exceeds a threshold which depends on the prior probabilities and loss 


funct’on. 
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Univariate Data — 2 populations 
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Tab la g 

S'suiation of Exaapla 3,3 
p ■ 1, q » 2 
* 10, n 2 - 40 


Population 

full Data 
r 2 

All ‘Data 
Hocking-Saith 

All Lata 
Kocking-Karx 

• 

p 2 

PH 

HSE 

mm 


Eat. 

Blaa 

KSE 

.4 

.22357 

.13567 

.06176 

.04640 

.06338 

.10519 

.05072 

.5 

.33333 

.10479 

.05731 

.03036 

.06312 

B 

.03412 

.6 

,45000 

.07241 

.04747 


.04407 

m 

.02698 

.7 

.57647 

.03946 

.03602 

.04230 

.03076 , 

.04723 

.02038 

.a 

.71111 

.02414 

.02437 

.03325 

.02425 

.03154 

'.01044 

’ .9 

.85263 

.00487 

.00923 

.02800 

.01027 

.01470 

.00297 
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Simulation with Example 3.4 
p » 2, q - 2 
- 40,. - 10 


Population 

Full Data 
• 

• • 

All Date 
Hocking-Mac* 

a 

""" P V_ 

l 

-- Eat.- - 

31*6 

~ Dati 
MSExlO 3 

Tract 

- Eat.- 
Bias. 

Dot. - 
MSExlO 3 

Tract 

.4 

.32653 

.04663 

.02471 

.01650 

.04479 

.01906 

.01483 

.3 

.44444 

.04347 

.02592 

.01564 

.04116 

.02160 

.01415 

.6 

.56250 

.03463 

.01757 

.01216 

.0 3 301 

.01267 

.01046 

.7 

.57820 

.02698 

.01202 

.00846 

.02678 

'.01032 

.00776 

.8 

.79012 

.02811 

.00787 

.00583 

.02691 

.00567 

.00503 

• .9 

• 

.39751 

.02764 

.00196 

.00304 

.02598 

.00157 

. .00270 
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Simulation with Example 3.4 ^ 

p « 2, q - 2 
n^ » 20, • 30 


Population 

?ull 3a cc 

All Data . 
Hocking- Harx 

a 


list. 

Bias 

Det. 

HSP.xiO 1 

Trace 

Cat. 

Bias 

Da:. 

MSExlO* 

Trace 

.4 

.32653 

.10331 

.16013 

.03897 

.08391 

.09417 

.02835 

.5 

.44444 

.07997 

.15378 

.03391 

.06511 

.08815 

.02444 

.6 

.56250 

.06436 

.11410 

.02544 

.05484 

.06114 

.017/9 

.7 

.67820 

.05895 

.08405 

.02067 

.03020 

.04097 

.01373 

.8 

.79012 

.05327 

.04995 

.01428 

.04692 

.02382 

.00988 

.9 

.89751 

.05384 

.01465 

.00963 

.04662 

.00659 

’.00697 
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Multivariate Time Series in Two Dimensions 
and the Classification Problem 


Outline of Presentation 

H..J. Newton 

1 

NASA/HP IRA Workshop 
January 27 - 28 , 1983 
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Introduction and Basic Aims of Research 


Transects and Classification of Model Signatures 


An Analysis of (Variance Approach to Finding Boundaries 
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Incorporating Temporal Correlation 


Inrorpoi at imj the Second Dimension 
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Some Computational Considerations 
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THE UNIVERSITY OF TEXAS AT AUSTIN 

AUSTIN, TEXAS 7871a 

w N83 23069 

A Mininux Approach to Spatial Estimation 
Using Affinity Matrices 

Carl N. Morris 

For presentation at NASA Project Meeting 
Texas A&M, January 27-28, 1983 

"An Empirical bayes Approach to Some Spatial Analysis Problems, 
with Special Attention to Remotely Sensed Satellite Imagery". 

Summary : 

Our problem is to combine estimates made in the plane 
to improve on noisy unbiased estimates. We will want to 
use only a small fraction of points in a giant grid to do this, 
those that are most like a given point. Section 1 below pro- 
vides a helpful component of this process defining an "affinity 
matrix" of values, indicating which points are relevant to 
others. Then Section 2 shows that minimax rules can be based 
on affinity matrices. 

1. Affinity Matrices ; 

Let a^j > 0 be the affinity of i for J, 1 £ i } ^ J <_ k. 

JfyJf 

Ue will assume for now that A = (a 1( j), the "affinity matrix", 
is symmetric and the rcws (and columns^ of A sum to unity 
.Someday we may wish to consider more general notions. In 
NASA applications, a^j might be some diminishing function of 
djLj , 'the distance from the 1 th to the j fch pixel, but there 
could be cases where affinities are greater for more distant 
areas, e.g. crops growing in two valleys may be more similar 
to one another than to those growing on a mountain between them. 
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An affinity matrix is Just a doubly stochastic matrix, 
one representing a reversible Markov chain, so much is known 

of them.' For example, the eigenvalues >_ ... >_ a k 

i , 

satisfy a j = 1 (e = (1, ..., 1)’ is its eigenvector), 

I 

a 2 < 1 (if the chain is irreducible), -1. I don't 

know under what conditions A ^ 0, i.e. £ 0, but we need 

j 

to know. It may be that h >_ 0 if, under anpropriate re-labeling, 

» 

l 1 

the elements a^j diminish as they recede from the diagonal. 

Is A _> 0, for example, if a^j depends only on | i- j | and 

decreases as |i-Ji increases? 

i ' 

Note that we can write 


( 1 . 1 ) 


A = E ai Pi 


with P t } = ee'/k and the {Pj_) a complete (EP^ = I) set of 

I I 

orthogonal projections, each with unit rank. 

t 

i * 

It's interesting to consider what affinities might be 

j | 

assigned to the center of a kxk grid k=5). Squared distances 

i 

from the center are listed below. 
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Fig. 1: Distances from center in a 5x5 grid. 
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This chart suggests nine and 21 point grids might be good. 

(Note: points outside the 5x5 grid all have squared distance 
greater than 8.) For 9 points, the a^j might be: a^ ° h (once), 
1/8 (ll times), and 1/16 times), which is given by 

(1.2) « 2** di «J 2 

The point is that the two dimensional grid in Fig. 1 is collapsed 
into one row of an affinity matrix, with 9 non zero entries 
in the case of (1.2). This means we can ignore the complicated 
spatial structure in Fig. 2 when working out theories of estima- 
tion, at least in the Independence case considered in the next 
section. 


f *12 

P i"-1 

• 

• 

0 

t 

1 p 

V 

0 

0 

p i"+l 

! . . . 


P^jpixel^ 

p i+l 

• • * 


(i) 




. X 




V °i 



V-i 

p i’ 

P i'+1 



» 

• 

0 

« 



Figure 2. Areal problem organized 
into pixels or pixel-groups. 
Responses Y^, true values 0^. 
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2. A minimax rule for spatial analysis. 

We take the simplest case of normal, independent observations 
(given the parameters) with equal (known) variances: 

ind 

Y 1 |e 1 “ N(0i, V), i»l,..., k. 


We expect the ( G ^ > to be correlated in any NASA application, 
but not necessarily the pixel intensity measurements { Y-j^ } (there 
was some question raised about this at NASA last August). In 
practice the variances would have to be estimated, and perhaps 
this means we would actually be working with clusters of pixels. 
We need data to pursue these points, as well as the equality 
of variances and all other assumptions. Until such are obtained, 
we may proceed as follows. 

Let A be any given kxk affinity matrix with non-negative 
eigenvalues = 1 > a 2 _> 0. Define 


( 2 . 1 ) 


Y* » AY, 


each YjL being a weighted average of the {Yj}. Let 


( 2 . 2 ) 


a _ (k-r-2)V 
B " E(Y ± - YJ)* 


with r = tr(A). Then let 

(2.3) e 1 = (l-B)Y 1 + B Y* 


i' 


be the estimate of 0 
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In case A a ee'/k, (2.1)-(2.3) reduce to Stein’s estimator 
1 § __ 

for shrinking all to Y^^ ** Y. More generally, this estimator 
is that for shrinking toward a regression surface E0=Z0, 

Z kxr , if A b Z(Z^Z)” l Z is an affinity matrix (since it needn’t 

I 

I 

be, we can see that the affinity matrix assumption is unnecessary), 

I 

We now prove that the estimator (2.3) is minimax for loss 


(2.H) 


L(e,e) = z(e i -e i )2/v 


in the, frequentist sense with risk R(0) depending on 0. Stein's 

i I 

derivative formula 


(2.5) 


: E e (y i- 0 1> f(Y i ) = v E 0 f/ <V 


will be used where needed. 

i 

The following Lemma will be useful. 

1 

Lemma 1. Let M and T be symmetric matrices, tj_ the i*' n column 

i 1 / 

vectori of T, and Q = Y MY. Then 


(2.7) 


•I ■$— (i t^Y) » [Qtr(T)-2Y'MTY]/Q 2 , 


Theorem 1. The! risk R(0) of (2.3) has unbiased estimate 


( 2 . 8 ) 


R “ k - (k-r-2)B + 4B(P-1) 
with P ! = Y / (I-A), 3 Y/Y / (I-A) 2 Y. 

' ! i 

Thus, with a k the minimum eigenvalue of A, 


I 
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(2.9) R £ k - (k-i*-2)D~ 1 4Ba k always 

<_ k - ( k-r-2 )B if a Jc >0. 

It follows that (2.3) Is minlmax if k > r + 2 when a k ^ 0. 

Proof ; Let Q = E(Yi-yJ) 2 = r"(I-A) 2 Y. The risk of (2.3) 

Is R(0) - E || Y-0-3(I-A)Y || 2 /V 

» k + EB 2 Q/V - 2EE(Y 1 -0 1 ) B (I-A) Y/V 

(i) 

where (I-A)^^ is the 1 th row vector of I-A. Now apply (2.5) 
to the last terra and Chen (2.7) with B a cV/Q, c s k-r-2, 
and remove expectations to get 

R = k + cB - 2B [k-r-2 - 
- k - B [k-r-2 -A (P-1)] 

Note that (2.9) follows because 0 <_ F <_ l - QED. 

Of course, we always have 

(2.10) ft < k - (k-r-6) B 

In (2.8) because > -1 for every affinity matrix. 

We should be able to extend this proof to cover the case 

a 

with B replaced in (2.3) by 

(2.11) B + = mln(B, 1). 

Further work must consider the following; 

1. Let A be written as in (1.1) and write S.^ = Y^P^Y. 
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Then the {S^> are independent non-central chi squares with one 
degree of freedom and expectation 


ie. 


ES ± *= E trfP^Y Y'] 

» V + o'PjlO, 

S ± 1 ? d VX^(l.|frL_). 


Note that B » (k-r-2)V/j(l-a 1 ) 1 '^ These facts may be useful 

x 

in further development of the sampling properties of rules. 

2. The rule will do well if e'tl-A) 2 © is fairly small. 
We will have to see if this is likely. Again, we need real 
data! 


3- Is this nearly an empirical Bayes rule? Por what 
(correlated) prior on the 0 i ? What correlated priors, ones 
like those we might expect in NASA applications, lead to good 
empirical Bayes rules? With a one-dimensional spatial (e.g. time) 
problem, autoregressive priors seem to lead to estimators like Y*. 

4. We need to find out a lot more theoretically about 
affinity matrices and their eigenstructure, and to consider 
which ones would be good for our applications. There will 
be problems near boundaries. Markov chain sources will be a 

goid place to start. Covariance matrices arising in autoregressive 
theory also may be a useful source. 

5 . Note that the rule presented (2.3) is easy to compute. 
Still, we should consider carefully the computational aspects 
of this and any other rules we choose to derive. 
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6. Real applications will involve multivariate Yi, 
say bivariate with 'i ll = greenness, Y^ = brightness. 

i 

I 

I 

! i 

i 

i 

References : i 

Stein (1981), Annals of Statistics , pp. 1135-1151 has 
a result implying Theorem 1 here. 
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Hubert Kostal 

| 

Localized Shrinkace Factors and Minimax Results 

J 

1 

Summary : A condition is derived under which a localized 

j 

shrinkage factor estimator will be minimax. A specific 

! 

localized shrinkage factor estimator is described. The 

! 

nohappl icablllty of the derived condition to some estimators 

i 

i s (unfortunately) shown. In the last section several 

1 : 

comments concerning these results are made. 


j ! ' 

i. fJASA Project': "An Empirical Hayes Approach to Some Spatial Analysis Problem: 

with Special Attention to Remotely Sensed Satellite Imagery". 



110 ORIGINAL PAGS IS 

or PC DR QUALITY 

1 . Mlnimax Results Applicable t< ~ Localized Shrinkage Factor 
Estimators . 

We shall be considering the equal, known variance case 

Y|0 - N k (Q, I), (1) 

for assessing estimators 0 of 0 with respect to SEL, 

L(e,e) = iKOi-Oi) 2 . (2) 

Stein (1981 Annals) shows that for fairly general estimators 


of the form 


0 - Y + g(Y), 


gjR^R^ (see Stein (I 98 I) for exact conditions on g) , 


the following results hold: 


E(Yi + g ± (Y) - Oj.) 2 = 1 + E(g?(Y) + 2Vi g ± ( Y) ) 


and so 


Ej| Y + g(Y) - 0 IP = k + EC lis(Y) [P + 2V. S (Y)) 


Here 


V.g(Y) = EV igl (Y) < 

where gi(Y) is the i th component of g. 

r* 

This result may be applied to estimators of the form 


0 = Y - A [i(Y)]AY 


> 
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where A is a preassigned matrix, XiR^+R^ and aQx(Y)[] is 
the kxk diagonal matrix with diagonal elements X^Y), . 

X k (Y). A jl ( Y ) is the localised shrinkage factor for Yj.. Thu3 

Oi - Yi - X 1 (Y)A i Y (9) 

where A^ i3 the i fc ^ row of A (the i*h column of A will be 
denoted a^ and a^j will represent the ij^ element of A). 

We 3hnll assume that X(Y) is chosen so that the necessary 
expectations exist 


Lemma 1 . 

ECYj. - Xi( Y) AiY - 0i)2 « 

1 + E(X|(Y) (A^) 2 - 2Xi(Y) ail - 2A 1 YV 1 X 1 (Y)) (10) 

proof: Apply Stein's result with 


Sj.(Y) “ 

so g|(Y) “ 

and V i g i (Y) - -X (Y) aj.i 

If Xj^(Y) is of the form 

Xi(Y) - 

where R* is a positive definite 

V 1 X 1 (Y) 


-X 1 (Y)A i Y, 

X|(Y) ( A^Y) 2 
- (A i Y)(V i X 1 (Y)). QED. 

1 

WtiTT ( 11 ) 

(symmetric) matrix, then 
-2X 2 (Y) Y'bJ. 


R 


( 12 ) 
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Theorem 1. For X^(Y) ns in (11) » 
B(Yi - li( Y ) Aj^Y - Oi ) 2 “ 


1 + E(X^(Y) Y ' ( A|A i - 2a 11 B 1 + '« b^A^Y). (13) 


* t ' 

Hence 0 i3 mlnimn:: if 


4 1 1 ' 1 

C 1 - aa^B 1 - t A^A ± - Hbj;Ai >_ 0 

I ! 

(i 3 n.n.d.) for i n l, • ••> k. 

proof: Applying Lemma 1 when (.12) holds yields 

I , . 

E(Y^ - X^YjAjY - 0 t ) 2 = l - E(X 2 (Y) Y'C^Y) 

! L ! 

and so if C A ■* 0 wo have 

! “ ' 

E || Y -A [X ( Y )] AY - 0 |f < k • QKD. 


( 1*0 


2. A Proposed Local ised Sh rin kage Factor Estima tor 

I 

Rewrit ting (b) as 

j 

Si! - (1 - A ± (Y) )Yjl + Xi(Y) A»Y 


(15) 


where A # i* I-A, It 


can be seen that X^(Y) determines 


the degree of shrinkage from Y^ to A*Y. Let 
1 | di 


where d^ 


( 16 ) 


is a positive constant. This choice of X^(Y) allows 


the shrinkage at Yj. to be determined by the Yj's which have 


nonzero Weight in A*Y. 


! 

I 





r. 

r 

[ 
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Lemma 2 . J(a£j ) 2 (Yj-A«Y ) 2 = d^’B^ for 

dlB 1 ■ j[a- J ( I r A {)]'[a« J d r A-)] 

where Ij is the J th row of the identity matrix I. 

In terms of A this is 

djB 1 » [Cl-aii ) 2 + j E i ajJlA^ 

Yn a i j (A l (I j’ I i ) + (I r I i ),A i 
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proof: 


+ (Ij-IiJ'dj-Ii)). 

(17) follows upon writing 


Noting 




a« Ctj-AJ) - U-a U > A l 


(17) 


( 18 ) 


for l“j 


~ a l j ( Ai+ ( 1 j ~ ‘ I i ) ) for i/J 


and expanding (17) yields (18). QED 

The estimator 0 with X i (Y) defined in (16) with 


a *j 


a*. 

i 


(19) 


for -J such that a* is nonzero, where a? is the average of 

ij *0 

the nonzero a* , is a spatially moving version of the 
i J 

James-Stein estimator. 


■ 
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Recall that in order to apply Theorem 1 B 1 must be 
positive definite (required for the expectations in ( 13 ) 
to exist). From (17) it can be seen that if a| k = 0 then 
the k^h row (and column) of will consist entirely of 
zeros, and so would not be positive definite. 


3. Comments . 

For spatial data which exhibits only local continuity, 
localized shrinkage factor estimators can reasonably be 
expected to do better with respect to MSE than estimators 
involving only a single (global) shrinkage factor. A number 
of simulations have shown this to be the case. 

Theorem 1 provides sufficient conditions for showing 
that estimators with localized shrinkage factors are minimax 
but, as shown in Section 2, these are probably too restrictive. 
Can the requirement that B>0 be eased? to B>0? Perhaps a 
different approach is necessary. 


/ 
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i TABLE 1 - Distribution of T-Ratios 

l 


i Segment 1645 - 216 Fields 


1 

Percentiles 

0 - 5% 

5 - 10% 

10 - 90% 

90 - 95% 

95 - 100% 

Number 

18 

14 

163 

9 

12 

Frequency 

0.2% ' 

6.5% 

75.5% 

4.2% 

5.6% 


X 2 - 6.72 


| Segment 163? - 57 fields 


Percent ih 

I 

!S 

0 - 5% 

5 - 10% 

10 - 90% 

90 - 95% 

95 - 100% 

Number ! 

6 i 

t 

1 

44 

4 

2 

frequency 

r 

10.5% - 

1.3% 

77.7% 

***•4 

b 

3.5% 



r = 5.45 
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COVARIANCE HYPOTHESES FOR 
LANDSAT DATA 

by 

Henry P. Decell 
and 

Charles Peters 
Department of Mathematics 
University of Houston 


A8STRACT 

Two covariance hypotheses are considered for LANDSAT data acquired 
by sampling "fields’ 1 , one an autoregressive covariance structure and the 
other the hypothesis of exchangeability. A minimum entropy approximation 
of the first structure by the second is derived and shown to have desireable 
properties for incorporation into a mixture density estimation procedure. 
Results of a rough test of the exchangeability hypothesis are presented. 
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Let X = ( X 1 / • • * /^ n ) be a random d x N matrix having a normal 
dN 

distribution in R . In the application we have in mind the colunns 

i 

Y . of X are the mul tlspectral measurements from the set of pixels in 

■ l 

a field randomly chosen from the set of all fields of a particular size 

1 

N and in a particular crop class. The inducing of the X. designates 

i ^ 

i 

the dictionary ordering by line and column number in the image. Since 

there is a high probability that X^ and X^ +1 come from spatially 

J 1 

adjacent pixels, the columns of X are not epected to be independent. 
We will consider two hypotheses concerning the covariance of the X.'s. 

> ! M 

In each, the process ,{X.}._ . is stationary with unknown mean u and 
covariance function r(h) = cov (X^.X^). 


HI: 


' till!' 

{X^} is first order autoregressive with r(h) = R^A 1 'R ? , 


where R dxd is positive definite and A dyd is symnetric with spectral 


radius les!s than 1. That is. 


i+1 
v, -k 


- y \= B(X. - y) + c. 


where B =j n J Aft" 2 and c^, ••• , are independently normally dis- 

1 i 2 

tributed with mean 0; and variance-covariance matrix (I - B )si. 


H2: 


The r.v's !Xj, ••• , X^ are exchangeable; i.e., the dis- 


tribution ;of XQ is the same as that of X for each N x N permutation 

I j 

matrix Q; In this case, r(0) = + E and r(h) = E for h f 0, 


where \{/ 


and i|i + NE are d x d positive definite symmetric matrices. 


H2 implies a number of things about the distribution of X, some of 


i 



which are listed below. 
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Theorem 1. If the distribution of X is normal and satisfies H2, 


then 


(a) the distribution of XU is the same or that of X for each 

W in the group 0^ = (U | is orthogonal and HJ^ * 0^}, 

where = (1» • • * 

(b) if P Nx (n_i ) satisfies P T P = I N-1 and P T J N « 0, then y = XP 

has columns y. , which are independently distributed as 
1 N _ i N 

N.(0,\|i); furthermore the statistics X = A Z X. and 

d N _ y N 1 = 1 1 

S = Z (X. - X)(X. - X) are independently distributed as 

1=1 ! ^ 

N.(h,F. + ^J>) and W d (N-l,\j>) respectively; (If is a family 

of normal distribution of X containing N df j(0,I), then (X,S) 

is sufficient for / V\y if and only if each member of satisfies 

H2), 


Proof: Both (a) and (b) are easily obtained after writing the 

covariance of X as F = ii> 8 I.. + Z 8 The ^rst assertion is 

d N N 

proved in f 1, and depends on (a) and the fact that (X,S) is a maximal 
invariant of 0^ acting on X as it does. 

2 . Approximating Hi by H2 , 

Suppose the density f(x) of X actually satisfies HI with 

parameter values p, R and A. In this section we will show that , in 
brsf 

a sense, the lust approximation to the distribution of X by one satisfying 
H2 is obtained whrn Z is nearly proportional to 1/N, for large N. 

This is plausible, since the average covariance between pairs of distinct 
columns under the Markov assumption is 0(~), see f 1. The method of 



rn»t»r * 


vw * •' ~ 
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approximation we choose is to minimize the relative entropy Qp poop QUALITY 


H(f, f) = (1og f(x}dx 

where ?(x) is a density function satisfying^ with parameters v (the mean), 
*J» and E . The entropy H(f, f) is weakly related to the L , distance 
by the following inequality, which is the sharpest we have been abltto find. 
The proof is essentially that given by Geman [ ] in proving that H(f , f) 


0 implies that 


ft 


f - * «• 


A b 

Theorem 2 ; Let f and f be arbitrary density functiomon R and let 
e > 0. Then 


jf If - f| dx s < * , . ^gUVO H(f. f). 


Proof: Define g( t ) for c > 0 by g(c) = e "_"iog( ) • Then g is 
positive and strictly decreasing on (0,™). Therefore, for £ - 1 •> £ , 


£ - 1 *- g(f ) I | - 1 • log j 1 


\ Jlf - fl = J (f - f) « 


A 

f > f 


J ( T- 1)f * . I ( T' 1,f 

0 < I ‘ 1 i £ 7 - 1 > C 


J (j - I)f + g(e) J C 7 - 1 - logf 3 f 


0 < 7 - 1 < c 


- 1 > 


i 

J 1 f 

nk 


s f + g(c) ! I t - 1 - log£ l f 



A A 

- 9(c) J (logf) f 


ORIGINAL PAQP [g 
0F p 00 R QUALITV 


*= c + gU)H(f, f). 


j j N 

Lenina 1: If X = (X f ( .. f X N ) d x „ satisfies HI, and ^ , 

I N _ t 


! $ -! 1 (x. - X) T 

! ! j- 1 1 

i | 

then (a) £(X) = y 

i 1 , 1 -1 -2 w Vi, 

(b) co/(X) - A) (I + A) A(I - A w )l JV 

i 1 _2 j. 

(c) £(S) * Nn - SI 7 [(I - A) (I + A) - |<I - A) A(I - A N )3 Jl*" 

| j 

j i 

Under H2, the log likelihood function is 

! i 

log^V) = - log|*| - ^log|Rl - | tr * _1 S 


1 1 tr R _1 (I -v)(X -v) T , 


f ■ 

when R *!* NI . By taking expectations and then differentials with respect 

' j a 

to the parameters, one sees that the maximum of £f(log f) is achieved when 

S 

j 

v f €f(^) 

I 

i j 

i * + r. k cov f (x) + g £f(s) 

j 

t F cov f (x) 5 m \ ir € f (s) 

t 

Combining! these results with Lenina 1* one has the following theorem. 


\ 

i 
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Theorem 3. If f in a dN-variate normal density satisfying HI, then the 

A A 

normal density f satisfying H2 which minimizes H(f, f) is the eortM£ 
for which 


fl 

n 

n 

n 


i 

7 


7 . - 


N 1 „ -1 o -2 M k 

N^r n_ imr n c(i - aj (i * a) - £ a(i - a) o - 
1 i 

1 7 “1 O ”2 M ^ 1 

~- r P. [(I - A) (I ♦ A) - | A(I - A) (I - A N )] P - jj-4-yP 


Notice that ^ + E 3 n and that R = <t« + Nr. 


1 

7 


-1 


-2 


N 


1 

7 


= n [(I - A) (I + A) - A(I - A) (I - A )3 n is always positive 
definite and is effectively independent of N for large N. The corres- 

n A 

ponding maximum value of £^(log f) is 

£ f Oog f) ■ - ^ lo9l*| - | log|R| - Sf . 


For large values of N, this is nearly 


-1 


c f (log V)£ - | log[p| - \ log |(I - A) (I + A)| - ^ . 
Under HI, 

log f (X) ■ - | logjpj - log 1 1 - A 2 | - \ Q{X), 


ii 

I! 

II 

!l 

I! 

n 

n 

n 

ii 


where 


1 _1 .1 T 1 .! 1 
q(x) = tr n"*(I - A) « 7 (Xj -nHXjni) + tr p ? (I - A 2 ) P T (X- N ii)(X-/*V 

N-l 4 ? -1 4 T 

- 2 E tr <« ^A{ I - A j P dx... - n)(X. -v) 
j=l J 


#*i 


TTT 5 


Arnmumi i wim Mm *m^F***nm 


r 


i I 


V0**^*m ll|^H P ! !B f ? ^ i yr f*^ 1 Pf ^ *!. • ' 
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+ l tr R (I - A") (I + A fc ) H (X, -w)(X:-p) 
j=2 1 1 


Hence, 


£ f (log f) « - J log|0| - ^-j-5- log 1 1 - A Z |- tr(I - A Z ) ‘a 


-1 2 


-1 


»2\ / T ■ *2\ 


+ (H - l)tr(I - A 2 ) A - tr (I - A 4 ) (I * A 4 ) 

= - J login] - 54" 1 l°9l 1 - a 2 | - ^ . 

Therefore, for large N, the minimum relative entropy is 

H(f, f)* - iS-y-L log] I - A 2 ] + \ log | (I - A) \l + A)] . 

One might think th«t because the inequality in Theorem 2 is symmetric 

A A A 

with respect to f and f, the f minimizing H(f, f) should also be 

A 

investigated. Unfortunately, H(f, f) does not seem to have a minimum 

A 

for all values of si, A, p. We do not know if H(f, f) can be made smaller 

A 

th^n the minimum value of H(f, f). 

3. A Mixture Density Model for LANDSAT Data . 

Suppose K fields are sampled from a population of fields representing 

in crop classes in proportion (these are not areal proportions; 

1 m <sd«U 

rather, they are the probabilities that a randomly related- field from the 
population will belong to the given classes.) The sizes N^, ..., of 
the sampled fields will vary; however, we may suppose that the population 
is suitably restricted so that each N. is independent of the crop class 

J 

and spectral data from the associated field, except to determine the di- 
mensions of the data matrix corresponding to that field. Let n^, .... n fc 
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1 k 

a/id x , , x be the observed field size8 and data matrices and suppose 

the classification of each field is unknown. Then the log-likelihood 
function for the given observations is 


i in • 

Si = £ log P [N=n.l + Z log Ea 1 f,(x J j N=n.) 

! j=i J j=i 1=1 1 1 J 

i 

' * * 

where f^(x J | N=nj) is the density of X J given that Yi el d j is in the 
1— class. The density f . is of the parametric form associated either 
with HI or H2. We are interested in estimating the parameters (par- 
ticularly the ajs) by maximum likelihood. Although he have almost no 
empirical basis for^ bel levying so, we consider HI more realistic than 

i ■ 

H2; however, we know of no computationally efficient way of maximizing 
the likelihood function 1 under HI (see Fuller [ ] for a discussion 

t 

j | 

of the difficulties involved for even a single clas$). Even with H2, 
the likelihood equations are difficult to solve and the £M algorithm 

i 

has no simple formulation, unless one assumes that for each class the pa- 

i \ 

rameterf £ is a constant times In this case, the likelihood equation 

i ! N 

take orj a simple form and are easy to solve iteratively f ]. If H2 is 

taken seriously, as in the random effects model suggested by Feiveson 

j j 

(see [ ]) there is| no justification for the additional assumption that 
*0 1 1 

E « jg— If, however, one regards HI as realistic and H2 as purely 

j j 

the most general feasible covariance model for purposes of estimating the 

i } 

oils, then the discussion in section 2 shows that the additional assumption 
■ r ! 

**0 f 

E = — * 1 s reasonable for approximating the true densities, at least for 
large f|ield sizes 1 N., ••• ,N.. 


t 


4 . Testing the Hypotheses. 
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Hypotheses and H2 will be subjected to several tests and the results 
discussed in a future report. Wa remark that likelihood ratio test for 
HI and H2 against all normal alternatives would be difficult to im- 
plement because of sampling difficulties, large dimensionality, and the 
aforementioned problems in obtaining MLEs under HI. We have conducted 
an informal test of H2 based on part{b) of Theorem 1. Let the data 
matfices from a sample of k fields be X*, ••• ,X^ having dimensions 
d x Nj,***,d x N^. let be any x (N^ - 1) matrices satis- 

fying the conditions of Theorem 1(b). Let y* * xV, = (yi|' , 'lyj i) 


fying the conditions of Theorem 1(b). Let y* * xV, = (yl|*”|yJ i) 

and let = g (yj) C £ (y^Hy^) 1 Then F^ has a central 

F distribution with d and N. - d - 1 degrees of freedom. Under the 

hypothesis H3 stated in part (a) of Theorem 1, this result is entirely 

independent, of the distribution of X^ , and follows from results of 

David f ] showing that the exact distribution of F^ depends only on the 

right spherical symmetry of Y^. However, H3 is stronger, in general 

than H2, except under normality. Table 1 shows the number of F^s which 

fell in the upper and lower tails of their respective F distributions 

2 

and the associated X statistics for 216 fields from LACIE segment 1645 

and 57 fields from segment 1633. In point of fact, the "fields" represented 

in Table 1 arc those produced by an automatic image segmentation program 

(AMOEBA) and may not be representative of real agricultural fields. The 
2 

X statistics are significant at levels between 10% and 20% so that 


Table 1 provides rather weak disconfirmation of H2. 
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A Hypothesis Test for the Rank of the 
Minimal Linear Sufficient statistic 


Richard A. Redner 
and 

William A. Coberly 


Department of Mathematical Science 
University of Tulsa 


January 28, 1983 
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An Adaptive Technique for Fitting LANDSAT data 

i 

Larry L. Schumaker and Larry F. Guseman, Jr. 
Texas AAM University 

Presented at NASA/MPRIA Workshop: Math/Stat 
i Jan. 27 - 28, 1983 


Abstract 


i 


! In this presentation we discuss some preliminary results on 
an adaptive scheme for segmenting LANDSAT images. The idea of the 
algorithm is to first fit a mixture of normals to the measurements 
in each channel to determine the number of classes represented by 
the data along with the mean values and variances of the measure- 
ments associated with each of these classes. 

| The information from the first stage is then used to adapt- 
ively compute a least-squares fit of a piecewise constant surface to 
the data. The resulting segmentation locates and labels the fields 
in the scene, and immediately yields a geometric estimation of pro- 
portions. Several numerical experiments are discussed along with a 
number; of suggested research questions. 
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INPUT MTA 


I1H22252222224444<<4<44444444 
111112222222222444444444444444 
111111222222222244444444444444 
111111122222222224444444444444 
1111111 12222222222444444444444 

ilii!llii?55 8 ifPMHHM 4 . 

111111111112222222222444444444 

- i'niiAiiiii 


iillllHliii 

. 111111111111111122222222224444 
111111111111111112222222222444 
1111111111111111111222222222244 
1111111111111111111122222222224 


0 WG;WAL pflp" rrv 

* P0 °« QUALITY 


rit'il It fro* ADATT2 » initial UloroACt ■ >40 


111 12222222222224444444444444 
11112222222222224444444444444 

WliliffBBBSHSHHHHia 

!iiiiii!i 5 i} 6 H 555 aKiaaa 

U 1 1 11 11222222222222444444444 
111111111222222222222224444444 

i!illl!lll!il55l!l3iiSiasaa 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 222222222244444 
11111111111111112222222244444 
[1 lillll fll 1111 112222222222444 
111111111111111111122222222224 
11111111111111111122222222224 
111111111111111111111222222222 


rotultt fro* ADAPT2 * initial toloranco > .1# 


1111122222222224444444444444444 


lllltl 

11111 

IJjJJJ 

iiiii 

urn 

mu 

mu 

initi 

mu 

iiini 


222222222244444444444444 
222222222244444444444444 
112222222222444444444444 
12222222222444444444444 
1 1122222222224444444444 
11122222222224444444444 
11111222522222244444444 
11111222222222244444444 
11111112222222222444444 
11111112222222222444444 
11111111122222222224444 
11111111122222222224444 
11111111111222222222244 
11111111111222222222244 
11111111111112222222222 


l 
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11112222222222444444444444444444 

11111222222222244444444444444444 


11111 

11111 

Hill 

Hill 

Hill 

11111 

Mil 

Hill 

mil 

HI!! 


22222222224444444444444444 
222222222444444444444444 
222222222244444444444444 
22222222224444444444444 
2222222222444444444444 
1222222222244444444444 

1 111222222222244444444 
111 H 2222222222444 44 44 
111) 112222222222444444 
1111111222222222244444 
1 11 11 1 1122222222224444 
111111112222222222444 


multi froa ABAPT2 t initial toltrancB • .SO 

11112222222222224444444444444444 

11H2222222222224444444444444444 

11H??22222222224444444444444444 

11112222222222224444444444444444 

11111111222222224444444444444444 

11111111222252224444444444444444 

II 11 1 11222222224444444444441444 
HI 111 11222222224444444444444444 

III HH 1222222222222224444444444 
11111111222222222222222244444444 
111 HI 11222222222222222244444444 
11111111 22222222222222224 4444444 
11111111111111112222222222444444 
11111111111111111122222222224444 
11111111111111111122222222224444 
11111111111111111111222222224444 


D 

H 


Si 


I! 

mi 


it 


.. ! 

it 


fl 

• 4 


rfttulti fron ADAPT2 » Initial tnlaranc* ■ .40 


{ t i 
i ! ; 


1111 2222222222224 44 44 4 44 4444 4444 
11112222222222224444444444444444 
11111122222222224444444444444444 
11111111222222254444444444444444 
11111111222222224444444444444444 
11111111225222224444444444444444 
11111111222222224444444444444444 
lilt 11 11222222224444444444444444 
11111111111122222222224444444444 
11111111111122222222222244444444 
1111111111111 1222222222244444444 
11111 llllt 111 11 12222222244444444 
11111111111111112222222222444444 
1111111111111111112-222222224444 
HHill 111 11 11 11 1122222222224444 
11111111111111111111222222554444 


r ) 


n 

i! 


T 1 
I 


i i 


IMPU? *ATA 


3111133311111111 

2111111111111111 

2231111111111111 

2223111111111111 

2222311111111111 

2222231111111444 

2222223111111111 

2222222311111111 

2222222222311111 

2222222222231111 

msmsm 

2777122222222231 

2222122222222223 


ORIGJN’.M. PA*- iJ 
OF POOR QUALITY 


nuUi fro* ADAPT2 r Initial tolaranc* • .M 

3111133311111111 

2311111111111111 

2231111111111111 

2223111111111111 

2222311111111111 

2222231111111111 

2222223111111111 

2222222311111111 

2222222222222222 

7722222222222222 

2222222222222222 

2222222222222222 

2222222222222222 

2222222222222222 

2777222222222222 

2222222222222222 

raaulit fro* ADAPT2* Initial tnlaranc* • >30 

311 1 133311 1 11 ! 1 1 
23111111U111111 
2231111111111111 
2223111111111111 
2222311111111111 
2222231111111111 
2222223111111111 
2222222311111111 
2222222231111111 
7722222223111111 
2222222222311111 
2222222222231111 
2222222222223111 
2222222222222311 
2777222222222231 
2222222222222223 


ratulU fro* ADAPT2 • Initial tolaianr* * .44 

3111133311111111 

2311111111111111 

2231111111111111 

2223111111111111 

2222311111111111 

2222231111111444 

2222223111111111 

2222222311111111 

Miillii 


nmiinnmn 


162 


! 


INPUT DATA 

12222222222222222222222222222222 
11222222222222222222222222222222 
111 222222222222222244 <4222222222 
{ It 1222222222222224444 4 422222222 
t 1111222222222222442224 4 42222222 
11 111122222222222442222442222222 
-{2222244444422222222 



11111111111222222222222211112222 
11131111111122222222222211112222 
1 133311 1111 112222222222222222222 
llSwmilil 11552222222222222222 
1 1 1 1 11 11 1 1 1 1 1 1 1222222222222222 " 
111111111111111122222222222222 


ORIGINAL FAC" !3 
OF POOR QUALITY 


raaulta fro* ADAPT 2 » initial tolaranc* ■ .7# 

11222222222222222222222222222222 
11222222222222222222222222222222 
11112222222222222233444422222222 
11112222222222222244444422222222 
11112222222222223333224433222222 
1 1 1 122222222^2223333223333222222 

11111111111111112222222222222222 
111111111111111 12222222222222222 


11111111111111112222222222222222 
11111111111111112222222222222222 
11111111111111112222222222222222 
ill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2222222222222222 
111 Hill'll 11 111 112222222222222222 
[111111111111111 12222222222222222 


r*»ult« fro* ADAPT 2 * Initial tolaranca ■ .3# 

'11222222222222222222222222222222 
It 222222222222222222222222 222222 
11112222222222222233443322222222 
11112222222222222244444422222222 
11111122222222223333224422222222 
11111122222222223333223322222222 

ltHHII83383533M«j8888i 
1 1 1 1 1 11 1 1 1 2227222222222222222222 
ill 111 11 1112255222272222222222222 
1111111111 1 '22222222222222222272 
1122111111 1155522222222222222222 
11332211111111112222222222222222 
11332211111111112222222222222222 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2222222222222222 
It lillllll ijl 11 12222222222222*22 

i 
| 

raaulta fro* ADAPT2 » initial tolaranca » .2A 
11222222222222222222222222222221 


11222222222222222222222222222221 

11115252222222522224444222272221 

11112222272222222334444332222221 


imiuiiii 

liMIlllill 

iisiBiimj 



mmm 

22222222222222 

^22222222222222 

225552222222222 


-XT." 


I 

I 


33333333333333333333111111111133 
1333333333333332333331 1111111333 
11333333377333333333331111 113333 
1 1 13333337773333333333311 1333333 
11 1 1 333333733332233333331333333 } 
11113353333333333333333 : 

. 21111111 4333333133333331333 $ 
211111 1144433333 1 3333333339333 $ 

Hiiil«SS«8iB§iSB2SS3i33i 

11111444444444111111114444444444 

11114444444444411111144444444444 



rttulit fro* AMMl » Initial .alarsnca 


33333333333,3333333311 1111111133 
22333333332333333333221111112233 
11333333333333333333331111113333 
11223333337733333333332211333333 

Hli333B3313fi333i3S38H333H 

1111 1133333333332323333333332333 
11111122332333333333333333333333 
1 1 1 1 1 1 11333333333333333333334439 
1 1 1 1 11 1 1 3333333333333333334 43333 
1 1 1 1 1 111333333333333333333353333 
11111111333333333333333344333533 
1 1 11 1 1-33444433333333333333333353 
11111144444433333333334433355333 
11113344444444111111114444444444 
11114444444444331111334444444444 


raawltt fro* APAPT1 t Initial toltranc* 


33333333333333333333111111111133 

33333333333333333333221111112233 

33333333333333333333333333333333 
33333333333333333333333333333333 
33333333333333333333333333333333 
33333333333333333333333333333339 
1 11 1 1 1 11333333333333333333334439 
11111111333333333333233233443335 
11111111333333333333333333335333 
11111111 333333333333333344333535 
11111133444433333333333333353333 
11111144444433333333334453333333 
11113344444444111111114444444444 
1 11 14444444444331 111334444444444 


164 


oriGu^l 
OF POOR Qo/w' 


INPUT SATA 


333333333333333333331 11111111133 
13333333333333333333311111111333 
11333333337333333333331111113333 
1 1 1333333777333333333331 l 1333333 
1 1 1 13333337333333333333313333333 
1 1 1 11333333333333333333333333333 
11 111133333333333333333333333333 
2111111 3333333333300333333333333 

BUlMSSiUJSJSiSSH 

2211111 1143333331333333333353533 
21111111444333331333333333333S33 
1111111 4444433333333333355555555 
111 11 144444443333333333535333533 

imuiuttamiMiHiaHaa 


result* froo ADA*T1 » initial tolerance * I >40 


33333333333333333333111111111133 
223333333333333333332211 1 I 1 12233 
11333333335533333333331 111 113333 
1 1 22333355773333333333221 1 333333 

misraasHfflBrara 

11111133333333333333333333333344 
1 1111122333333333300333333333355 
11111111333333333333333333334455 
11111111333333333333333333445555 
11111111333333333333333333535555 
1 1 1 1 1 1 11333333333333333344555555 
11111133444433333333333355555555 
1 1 1 1 1 144444433333333334435535355 
11113344444444111111114444444444 
11114444444444331111334444444444 


n • initial tolerance • *10 


33333333333333333333111111111133 
22333333333333333333221111112233 
11333333335533333233231111113333 
11223333557733333333332211333333 
11113333333333333323333222333333 
11 11223333333333333333333333333 3 
11111133333333333333333333332344 
111 111223333 3333330033333333J 355 
11111111333333333333333332334455 
11111111333333333333333333445533 
1 1 1 1 1 1 11333333333333233333353555 
1 1 1 1 1 1 11333333333333333344333353 
1 1 1 1 1 1 33444433333333333333335355 
11111144444433333332334453355353 
1 1 1 1334 444 44441 1 111111 4444444444 
11114444444444331111334444444444 


I 

I 


r. 

r 

r 

i: 

i: 

r 


j. 


INPUT DATA 


3111155511111111 

2311111111111111 

2231111111111111 

2223111111111111 

2222311111111111 

2222231111111444 

2222223111111111 

2222222311111111 

2222222231111111 

7722222223111111 

2222222222311111 

2222222222231111 

2222222222223111 

27^7122222222231 

*. 4-d.i- L £. d. 4 t-i.d-d.d- 4 .*.!. 


* 7 


."j 


OR! 

OF POOR QUAUTY 


results froo ADAPT2 » initial tolerance = .90 

| 3111155511111111 
2311111111111111 
2231111111111111 
2223111111111111 
2222311111111111 
2222231111111111 
2222223111111111 
2222222311111111 



«. * 4 . * < 

4 i. dm m. t. — 

Tn'i'io 
't'l'lOTT')' 



results fro* ADAPT2* initial tolerance = .50 
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1 

1 


1 . 


3111155511111111 

2311111111111111 

2231111111111111 

2223111111111111 

2222311111111111 

2222231111111111 

2222223111111111 

2222222311111111 

2222222231111111 

7722222223111111 

2222222222311111 

2222222222231111 



2777222222222231 


. I 


result 

f 


( 

i 


s t'ror. A HAP T 2 t initial 
I 

3111155511111111 
2311111111111111 
2231111111111111 
72231 iiiimiin 

222231 111 1111111 
3222231 1 1 1111444 
22222231 11111111 
2222222311 1 11111 
222221’.' 23 1111111 
7722222223111111 
2222222222311111 



tolerance 


i 


.40 


l 


I- 


■i 

Cl * 

it — < 

cl a 
-d o£ 

r -t O 
O 
o 0 . 


Sfc 


O 

r** 


o 

to 


<r 

o 


C- 


C4 eg rgr i r j rg a rg r i eg r g a 

Ci C4 C4t 4 C4 C4C iC 4C4C4C iC4 
Ci C4C4C4 C4C* C4C4C i% 4C ir4 
C4 C4C IC4C4C4 C4C4C4CtnC4 
C4 C4C iC4C*C4C4C4C4C4~+w - « 
C4 C4C4C4C4 C4C iC4C4C4~t~* 
C4 Pgrgrg ; g P i rgpgpgpg-*— i 
pj rg nr 4 <• «r c gegr jpg— 

IN pg PJ «*■ *r *TC 4 C gpgr 4 ci 

C4 c j »<r «r eg o 1 **' rgpg rgrg 

C4 r4 «r *r rgpg«r^‘rgrgrj rg 
rg eg «r*r r«rg <r ^rpgpgf gpg 
ri eg «r«sr rgpg «r*r pjpj pi rg 
rg rgrg^r^ *r«-rg pgrg rgpg 
rg ne-grg «r rjrgrgrgeirg 

C4 C4 C4 rg C4 Ci C4C4 P g T 4 P J P 4 

rg eg p-jp; r jpjrgrgrgpgr jpg 
rgpg p jpg c j Cg r jpgcjrgr gri 
rg C4 rgrg rgrg r jrgrgrgrjpg 
eg c g rgrg r < rgrin rgpg ncg 
pg cirgrg pgegnrJrjrjrg-* 
eg cgpgr j rgrg rgr jrgr i-*-* 
rg eg rgci pgpg pgpgr j~ 

C4 fgpgpj rgrj 
eg rg rgr jpg eg eg 
eg eg rgcg 

eg egegrgpg — <--4 ^ 

eg eg rg c j — i 
rg eg cg-“«— to 
cgCg^g'“<-g'' H ~^cg-*— h-h—* 
e g ^ ri r g eg ^ 


eg eg eg eg 
rgpg pgpg 
eg eg rgpg 
ejpgegrg 

pgpgr grg 
rgpg eg rg 
eg eg rgr g 
rgripgrg 
pgpg rgrg 
rgr < rgpg 
eg eg rg rg 
rgrgpgeg 
P jpgr jpg 
pjpgrgpg 
pjpg e-jpg 
egpgpipg 
eir.ri— « 

rg— ♦*-« -* 


tOro — « 


0 / 

u 

c 

a> 


qj eg egpgrjegeg^grgrgpg 
u eg rg rg eg eg eg rgpg rg rg 
C eg egejpgegrgejpgrgpg 
ro rg pgpg rgrg rgpgpg rgrg 
*- eg eg rgpg rg f *■ rgpg rg eg 
o eg rgeg rgp 4 pg rgpgrgrg 
— • eg r i pgpg roro pgpgrgpg 
O rg rirgpgtotorgi grgpg 
— > rg pg^ <r pgpg 

pg pg^r vm *r«r rgrg 
—i rg eg *r *r rg rg *r «r eg pg 
'0 rg rg*r*r rgpg «r*rpgpg 
ri ruo*r roro ^ lorgrg 
— > rg rgpxrrirt ^rrorgpg 
rg pgegpg *oro rgr gr jpg 
c r j rjnrg rorocgpj rgpg 
•** pg rgpg n rgpg rgpg 
eg pgegpg eiegpgpg— 

- eg rg eg rg pg rg rgpg — « 
pg rgegrg rgegrirg 
eg eg eg rgpg eg eg pgr j-*-* 

♦— eg pg rg rgpg pgegpg 
u, eg rg rgpg rgpg pgpj 
<C rg pgegpg rgpg cjp i 
*z. eg pg eg eg eg eg *-*-* -h —« 

<X rg r? rgpg rgpg — 

rg riegej rgpg ^ ^ . 

5 pg pjrgrgpjpj*M^*H^^^ £ 

O fV P i ^ ^ ^ «m w-i "ri — ^ O 

*- — 1 -. t- 


rgrgegegrgpg 
rgpg eg rgpgpg 
rgpg rgpgpgpg 
pgpg rgp,rgrg 
rgpg pgpgrgpg 
rgpg rg rgr Jpg 
pgpg P ;rg rgpg 
rgP 4 f jpgpgpg 
pgpg pgpg pgpg 
rgpg C 4 C 4 t' 4 c i 
pgpg egpgpgr^ 
pgpg rgpgpg pg 
rgpg rgpgpgpg 
P Jpg rgr <rgrg 
rgpg rgpgpgpg 
pgpg rgcgpgpg 


V-^ ^ PJ 


C- 

<r 


*M V<l rH •"‘4 ^ ^ »M <J 


pg rg pirip-jpg 
pgpg pgpg pgpg 
pg rgeirgr Jpg 
rgrirgpg pgpg 
pg rgpgpg pgrg 
rg rgpgpgpgeg 
pg rarg^g r j rg 
r-J pgpgpj pgpg 
fgrjhW «r ro 
pg pgr^<* 4TfO 
pg pg <r «r r rg 
pg pg ^ pg pg 
pg P g fO ro r-> 

p g P J rO m r-i 

pg rgrgrgr-)h*i 
pg pgpg eg popo 
pg rjf gpj pgpg 
e-jpg pgpg picj 
rg rgpgpg pgpg 
pi rgpgpg pgrg 
pg rgpgpg p i eg 
p g pg rgpg pgpg 
ci rgpgpg rg rg 
pgpg pgpg pgpg 
pg pgpgpgr g Pg 
pg rgpgpg rgpg 
eg rgpgpg 
pgrg pgrg 

pg CJ*g»^ 
p g rg 


pgp irgpj r jrg 
r-^pg pgr j pgpg 
pgrg rgpg pgpg 
pgpg pgpg rgpg 
rgpg pgrg pgpg 
rgrgpgrg rgpg 
pgpg rgrg pgrg 
r irg rgpg r jp j 
• pro r gpg r^pg 
<rr, Pgrg pgpg 
«T*r pgpg pgrg 

— «r p jpj pgpg 

«r'-*rgrgr*rg 
<rro pgr . r^p j 
nr Jpg' jPmPj 

pjfinnrjn 
pgpg pgrg pgrg 

p jp-j r irgpiPi 
p jpj rgr j rgpg 
pgpg r jpg pgrg 
p.rg pgpg 
p jpg rgr i * 

c*r j — — 


pgr i pjpj 
rgfitjr 
rgcg** g 
pgr grg 
pgr* grg 
c jp « rg 
pg rg rgpg 
pgrg rgpg 
pg pg r i rg 
pg pg Pgpg 
pgpgr jpg 
f g Pg PgPg 
pgrjpgpg 
r^Pir-j 
r jrge / i 
r « rg c .pg 


*^r g 

— — rg 


r;rg » 

pg eg ^ 
firOn*-' 


u 

c 

ro 

t_ 

Ci 

o 


eg 

*— 

Cl 

<1 


VO 


IA 

t -4 


3 

QJ 


*A 

Q. 


rgpg rgr; pgpg pgr; rgpg 
pg rgpgpg rgpg pgpgpgr *pgf g pgpg pjp j 
rg pgpgp j pgpg rg r j rgrg pgpg pgpg pgpg 
PJ Pgpgpgpgc j pgp 4 rgpg pgpg pg pg pgpg 
Pg rgpgpg rgpg pgpg rgpg egp j pgpg •“grg 
eg rgf gpg rgpg egpgpgr ipgrg rgrg egp j 
pg rgrgrgpgp-g rjripgpgejpg rjpgr jp-j 
pj rgpgro rgpg pgpgrgpg rgpi cgpg 
pg pg pgpi *r «r pgpgr ip 4 pgp> pgpg 
pgpg^r «r n rg <r to rgpg rgpg pgpgp Jpg 
rg r.jgr«r ropg pjpgrgpg pgrg pjci 
pg r j rr *r rgpg r Jpg "gpg pgpg pgpg 
eg p j-t v rg rj^nf jpj pgpg r g p; pgpg 
pg pgpgrl «r <c* ro Pgpg pgp i rg pg r jp j 
pj rir j ro rgr ; rg p j rgpg rgpg r ic * 
rg fgpgpg top's rgpg Pjpj pjpj -gr j r jp ; 
eg pgr j pgrg r< egp-- rgpgpgpg 
rg pgpgp j pgpg pip t rgpg rgpg . 

pg rgpgpg rgpg rgr gpgpg pgpj 
eg ev eg r-J egpj eg pg rgpg pgp-. — ^ 
f jpgpgfg pjpgrgpg rin 
pg pgpgr g p jpg pgpg rgpg 
eg pgegpg P»P 4 pgp j — . 
rg r jpgpgpirj pg rg — «— . 

pg pgpj p j 

PJ pgpgpg pgpg *-• 

pg C4C4C4 wHw-1 ■*-* 

r g r j pg p g ^ -4 ^ eg p j • 

p g pj •-* ^ ^ ^ »g h ro ro 



i 


167 


i INPUT PAT A 


i 

) 

I 

i 

I 


3333333333333333323311: 111111133 
13333333333333333333311111111333 
1 1 33333333733333333333 111111 3333 
11 133333377733333333333 U133'333 
11112333337333333333333313333333 
11111333333333333333333333333333 
1 1 1 11 1 33333333333333333333333335 
2111111 3333333333300333333333355 
22111111 333333333333333333333555 
1221111 1 1 33333333333333333355555 
221111111 43333231 333333333555555 
211111 1 144433J331333333335555555 
111 1 1 1 1 44 44433333333333355555555 

II 111 1444 1 4 44J3 3333333355 5555555 
11111444444444111111114444444444 

III 144 4444444441 U 1 1 1 44444444444 


original *3 

OF POO" QUALITY 


result? fro* AHA r Tl t initial tolerance * 1.00 


33333333333337333333111111111133 
223333333.V3333373333221 1 1111 2233 
1133 33333333331111113333 
1 1223333557733333333332211333333 
11113333333333233333333322333333 
1 1 1 172333333 333333332-33333333333 
1 1 1 1 1 1 33333333333333333333333344 
11 1 11 1 27333333333300333333333355 
illll 11 1 133333333333333333334 455 
11 11 11 1 1 3333333333333333334 45555 
11111111333333333333333333555555 
11 1 1 1 1 1 1333333332-333333344555555 
11111133414433333333333355555555 
11 111 14 44 4 4 4333333332-34455555555 
11113344444444111111114444444444 
111144444444443311 11334444444444 


results fr. a AUAfl 1 * initial tolerance = .80 


333333333333333333331 11111111133 
2233333333?333333333221 1111 12233 
1333233335533 <j 33333 nun 33 

1222333557733333333332211333333 
1113333333332333323333322323333 
1 1 12233333333332333333333333333 
1 tm33333333333333J332J3333214 
1111122333333333300333333333355 
111 1 1 1 1323333333333333333234455 
lit 1 1 1 1333333333333333333445555 
lit 11 11332-332333333333333553555 
111 111 1333333333333333344555555 
1 1 1 1 133444-132333333333355555555 
till 1 4 44-144333-33333334455555555 
1 1 133441444441 1 t ! 1 ! 11 4444444444 
1 i 14444 44 m 133 111 133444444 414 4 


168 


INPUT HAT A 


ORIGINAL PAG£ 
OF POOR QUAU 


333333333333333333331 11111111133 
13333333337333:23333311111111333 
11333333337333333333331111113333 
11133333377733332333232111332333 
11113333337333333333333313333333 
11111333333333333333333333333333 
II 1 1 1133333333333333333333333335 
21111 113333332232300333333332255 
22111111333333333333333333333555 
12211111133332333333333333355555 
22111111 H 333 333 1333 3333 3 3555555 
21 111 11 1444333331 333333335555555 
11111114444433333333333355555555 
11111144444443332333333555555555 
11111444444444111111114444444444 
11114444444444411111144444444444 


results froa ADAPT 1 > initial tolerance = .75 


3333333333333333333311 1111111133 
22333333333333333333221 It 11 12233 
11323333335533323333331111113333 
11223333557733333333332211333333 
11 1133333333333333:3332233333333 
11112233333332332333333333333333 
11 1 1 1133333333332333333323333333 
11111122333333333333333333333333 
11111111333332333333333333334455 
ljlit 1 11 332333333333333333445555 
11111111333333333333333332555555 
11111111333333333333333344555555 
11111133444433333333333355555555 
111 11144444433333333334455555555 
1 1 113344444444111111114444444444 
11114444444444331111334444444444 


results froa ADAPT 1 » initial tolerance = .70 


33333333333333333333111111111133 

33333333333333333333221111112233 

33333333323333333333331111113333 

33332333333333332333332211332333 

33233233333332223333333333333333 

33233333333332233333333333233333 

33333333333333333333333333333333 

33332333333333333333333333233323 

11111111333333333333333333334455 

11111111333333333333333333445555 

11111111333333333333333333555555 

mm:: 333323322333333344555555 

1111 r. 33444433333333333355555555 
11111144444433333333334455555555 
11113344444444111111114444444444 
11114444444444231111334444444444 


INPUT fcATfi 

11112222222222444444444444444444 
1 1 11 12"'2222222244444444444444444 
11111122222222224444444441444444 
11111112222222222444444444444444 
11 1 11 1 1 1 22222222224 44 4 4444M4 4 44 

I 1 11111112222222222444 4444444444 
1 11 11 1 111122222222224444444 44444 
11111111111222222222244444444444 
11111111111122222222224444444444 
11111111111112222222222444444444 
11111111111111222222222244444444 
11 11 mil 11 1U 122222222224444444 

I I It 1 1 1 1 1 1 1 1 1 1 1 1222222222244 4444 
11111111111111111222222222244444 
1 1 1 11 1 111 1 1 1 1 11 1 1122222222224444 
11111111111111111112222222222444 


results f r ns AUAPT2 » initial tolerance 


ORfGfMAL Fa G t rs 
0F poor ctwur? 


111122 

111122 

111122 

111122 

nun 

min 

nun 

mm 

mm 

mm 

mm 

mm 

mm 

mm 

min 

mm 


i'n'l'rV ‘,1 » 

n ■»«****» 

4 . •_ 

1 1 222 ^^222 
j j o'lon'T'i *> 

1 1 - v 2 #> 2 ,> 2''2 
1 1 2^222222 
1 1 22222222 
j j '> n '> ■ p > 2' > 22 

i 1 t.m. 4 - 4 . — 4 - - *. 

lmiimi 

1111111111 

mmuii 

mimm 


4444444444444444 
4444444444444444 
44*4444444444444 
4444444444444444 
4444444444444444 
4444444444444444 
4444444444444444 
144444444**444444 
2222224'* 1 1 144444 
2222222244444444 
2222222244444444 
2222222244444444 
2222222222444444 
1122222222224444 
1122222222224444 
1111222222224444 


results froiw ADAPTS > initial tolerance 


11112222222222224444444444444444 
1 1 112222222222224444444444444444 

II mi2 ? 2' > '''’222244444 44444444444 
1 11111U222222224444444444444444 
HUH H2212222244 44444444444 44 4 

III 1 1 1 1 1222222224444444444444414 
11111111222222224444444444444444 
11111111222222224444444444444444 
11111111111122222222224444444444 
11111111 UU22222222222244444444 
111111 11111 H1222222222244444444 
11111111111111112222222244444444 
111 HI 1111 11 11112222222222444444 
1 1 1111 1 11 11 11111 112222222222444 4 
m m 111 111 111 11 122222222224444 
111 111 111 111 111 111 1 1222222224444 


s 


I 


I 


INFUT |*ATA 


111 12:22222222444444444444444444 
11111222222222244414444444444444 
11111122222222224444444444444444 
11111112222222222444444444444444 

II 1111 1 122222222214 44444 44444444 
11111111122222222224444444444444 
11111111112222222222444444444444 
111111 1111 1222222222244444444444 
11111111111122222222224444444444 
11111111111112222222222444444444 
11111111111111222222222244444444 
11111111111111122222222224444444 
11111111U111 11 12222222222444444 
11111111111111111222222222244444 
11111111111111111122222222224444 

III 1 1 1 1 1 1 1 1 1 1 1 1 111 1 2222222222444 
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results fro* ADAPTS t initial tolerance = 


i j 

11112222222222224444444444444444 
1 1112222222222224444444444444444 
11111122222222224444444444444444 
11111111222222224444444444444444 
11111111222222222244444444444444 
11111111222222222222444444444444 
11111111222222222222444444444444 
Ull 1111222222222222224444444444 
111 1111111112222222222444 4444444 

II till 11111 12222222222224444 4444 
11111111111111222222222244444444 
111111 11 1111 HI 122222222444 444 44 
111111 Mill 111 1 12222222222 444 44 4 

III 111 1 1 111 111 1 111 22222222224 444 
1 111 1 1 111 111 111 111 2222222222444 4 
11111111111111111111222222222244 

I | 


' results A[iAPT2 i 'initial tolerance = .10 

i j 

II 112222222222444444444444444444 
1 1 1 111 22222222224444444444444444 
11111122222222224444444444444444 
11111111222222222244444444444444 
11111111222222222244444444444444 
111)1111112222222222441444444444 
1 11 1 1 1 11 112222222222444 444444444 
11111111111122222222224444444444 
11111111111122222222224444444444 
11111111111111222222222241444444 
millt 1 1 111 1 11222222222244444444 
1 111 111 mill 1 112222222222444444 
11111 1 1 1 1 11 1 1 1 1 122222222224444 44 

mmmm uni 122222:22224444 

III 111 111 111 mill 2222222222444 4 
111,111 1 111 1 111 Ijlll 1 1 222222222244 

1 I 



J 



APPENDIX 



This Page Intentionally Left Blank 




r 

r 

r 

i 

i 


ORIGINAL PAGE IS 
OF POOR QUALITY 


FUNDAMENTAL RESEARCH DATA BASE 


At the request of Or. R. P. Heydorn, a fundamental research data base has 
been created on a single 9-track 1600 BPI tape containing ground truth. 

Image, and Badhwar profile feature data for 17 North Dakota, South Dakota, 
and Minnesota agricultural sites. Each site is 5x6 nm in area. Image data 
has been provided for a minimum of four acquisition dates for each site. 

All four images have been registered to one another. A list of the order 
of the files on tape and the dates of acquisition is provided in attachment 1. 


1T83 23072 
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Attachment 2 provides Information on the format of the ground truth tape 
and a table for each year to use in interpreting the information on the 
ground truth tape. Ground truth codes vary depending on the year. Like 
the Landsat image files, ground truth files cover an image 196 pixels wide 
by 117 lines long, but the actual size of the ground truth image is 392 
pixels by 234 lines. The reason for this difference is that there are 
six ground truth subpixels for each Landsat pixel, as illustrated. 



Landsat Pixel Ground Truth Pixel 


i i 

The symbols: Cj, C2» 03,04, C5 and Cs represent the ground truth crop code 
for the various sub-parts of the Landsat pixel. We typically use a plurality 
rule to decide on a single label for a Landsat pixel. 


All files are stored on 


tape in universal format. 


Image files and Badhwar 


profile feature files contain four channels of data, but since three Badhwar 
profile features are provided in the feature files (tn, a, and GjnaxJthe fourth 
channel is always zero.! The format for image and profile files is the same 


and is provided In attachment 3. 

1 
1 
1 


i 


i 


PRECEDING PAGE; BLANK NOT FILMED 

i ! 


i 

1 


1 

1 
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File 

TT 

6-10 

11-15 

16-20 

21-27 

28-32 

33-37 

38-43 

44-48 

49-53 

54-58 

59-63 

64-68 

69-73 

74-78 

79-83 

84-88 

89-94 



Type Segment 
Image l3S0 
1394 
1531 
1537 
1544 
1553 
1566 
1619 
1636 
1650 
1653 
1663 
1676 
1755 
1784 
1825 
1899 
Image 1920 


Year State 
“75 Rn 
78 

77 

78 
78 
78 
78 

77 

78 
78 

78 

77 

79 
79 

78 
78 

77 

78 NO 101 


174 

211 

220 

129 

147 

184 

141 

159 

194* 

122 

140 

158 

194 

203 

211 

133 

169 

196 

140 

158 

175 

154 

190 

207 

191 

209 

218 

154 

155 

191 

138 

156 

174 

165 

184 

211 

147 

166 

184 

169 

196 

223 

196 

206 

223 

140 

157 

175 

136 

199 

209 


238 

22C 

221 

176 221 230 

220 
232 

176 230 

226 

236 
208 
211 

237 
220 
241 
224 
193 

217 236 


/ 



Acquisitions (Julian Date) 

TT5 169 196 204 ZZ2 

NO 120 

MT 112 

122 

MT 104 

MT 122 

MN 115 

NO 122 

NO 135 

NO 156 

136 

NO 121 

SO 120 

SO 120 

SO 133 

MN 133 

NO 122 


File 

Jy 

■pe 

Segment 

File 

type 

Segment 

"W 


r r 

■ tm~ 

"TO 

Profile 

1380 

96 


r 

1394 

114 

/ 


1394 

97 

( 


1531 

115 

V 

1531 

98 



1537 

116 



1537 

99 



1544 

117 



1544 

loo 



1553 

118 



1553 

101 



1566 

119 

j 


1566 

102 



1619 

120 



1619 

103 



1636 

121 

) 


1636 

104 



1650 

122 

/ 


1650 

105 



1653 

123 

i 

1 

1653 

106 


) 

1663 

124 

I 


1663 

107 



1676 

125 

l 


1676 

108 

I 


1755 

126 



1755 

109 



1784 

127 

l 


1784 

110 



1825 

128 



1825 

111 



1899 

129 

I 


1899 

112 



1920 

130 

/ 


1920 


Two end-of-flles 





ATTACHMENT 2 



3.2.1 HEADER RECORD 

The Header Record is the first record on the tape and contains 
3060 bytes (8 bits per byte). The record is zero filled except 
for those bytes listed in the following table. The values 
contained in the listed bytes are all constant except for bytes 
61 through 63. The attached tape format contains identification 
and descriptions for each byte. The description and format of the 
Header Record is contained in [attachment 3. 


Byte 

Value 

Byte 

Value 

Byte 

Value 

61 

Day 

96 

1 

111 

120 

62 

•Month 

97 

-120 

1778 

1 

63 

Year 

100 

2 . 

1786 

1 

81 

-128 

101 

28 

1787 

1 

89 

1 

104 

1 

1788 

-120 

; 90 

l ; 

106 

70 



: 91 

8 

109 

1-- 



153 

1 :L_. 

110 

• 

111 

- 

± r. .* 

il * 


Each video scan line is 504 bytes long; a 2-byte record counter, a 70-byte 
ancillary block, and 392 bytes of ground truth (two of the six subpixels for 
a 196 pixel scan line). It takes three video scan lines to complete one 
scan line of ground truth. See the following page for diagram. 



1*- 
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Each video block will be the same number of bytes in length. If 
this tape contains raw data the FCN sync words associated with 
the video data, if any, will be included, with the video data on 
this tape. If this tape contains processed data, no sync words 
will be present. 

I 

i 

j 

The arrangement of data for each pixel is shown in the following 
diagram. Data for subpixels 1 and 2 for pixel 1 is found in 
bytes 73 and 74 of the first data record. Data record 2 and 3 
contain data for suppixel 3 through 6 in the same format as 

record 1. i 

i i 

I ‘ 

I I 
i 


i | 

j Record Byte 

73 74 75 76 
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A 

3 

B't 

C 

CN 

Qi> 

G 

H 

I/CC 
I/GS' r 
I/F 
I/RE 
K 
hi 
NA 

'o 

p 

PF 
R 
SB 
SF 
SG 
SR 
SU 

© 

SY 

T 

TR 

VW 

© 

* 

>; 


p 

PF 


/ 


APPROVED SVMROL LIST 

n+c % /i?7 ' 




ALFALFA • 

BARLEY 

BEANS 

CORN 

COTTON 

FLAX 

GRASS 

HAY 

IDLE COVER CROP 

IDLE CROPLAND STUBBLE 

I OLE CROPLAND FALLCV! 

IDLE CROPLAND RESIDUE 

MILLET 

•MOUNTAINS 

OATS 
PASTURE 
PROBLEM FIELD 
RYE 

SUGAR BEETS 
SAFFLOWER 
'SUDAN GRASS 
SORGHUM 
SUNFLOWER 
SPRING WHEAT 
SOYBEANS 
TREES 
TRET-;ICALE 
VOLUNTARY WHEAT 
WINTER WHEAT 
WATER 
HOVE SI E Aii 




SO 
101 

i 91 

r 

p 92 

i 111 

j 103- 
i 105 
| 106 
! 252 
251 
254 
j 253 
112 
241 


104 

107 

80 

102 

98 

93 

95 

96 
t 94 
j 100 
! 9? 

103 

109 

no 

j 99 
i 240 
250 


V. . 








/ 




177 


A. .r 

S' Cl' 

V V 










<?• 





115 

140 

165 

. 190 

215 

126 

151 

176 

201 

226 

116 

141 

166 

191 

216 

117 

142 

1 67 

192 

217. 

II 

136 

161 

r 186 

211 

236 

T23 

153 

178 

203 

228 

130 

155 

180, 

205 

230 

131 

| 156 

181 

205 

231 

- 

- 

- 

- 

- 

i 

i 

- 


*• 

- 

i 

| ‘ 

m 

• 

- 


1 ** 

«b 

- 

•• 

137 

; 162 

187 

212 

237 

j 

• 

i 

- 

- 

1 

129 

; 154 

179 

204 

229 

132 

! 157 

183 

207 

« 

232 

127 ’ 

152 

177 

202 

22 7 

123 

14S 

173 

19S j 

223 

118 

143 

168 

193 | 

i 218 

120 ! 

145 

170 

195 i 

220 

t 

121 i 

146 

170 

196 

; 22i 

m j 

144 

169 

194 

219 

125 i 

150 

175 

200 

225 

122 : 

147 

172 

197 

222 

133 i 

| 

153 

183 

208 

| 

233 

134 1 

| 

159 

184 

209 

234 

135 ! 

160 

185 

210 

235 

124 : 

149 

174 

199 

j 224 


m * 



i 


* 
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'TV C?.0? CODE 





Grass (G) 


frfaV, Sudin*t»r »ss . Mi 


Pasture (?) 


Trees (T) 


*Open - to be assioned as needed 


rasture , •, _ 

136 * Mix (PM) IS? 


137 
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1979 Crop fear Keys and Delineation Codes 
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Crop Harvested Crop Abandoned 


Crop Harvested i 
for Silers j 



Alfalfa ' 

buckwheat 

Barley 

Clover _ < 

Corn 

Cotton j I 

Dry Bean 
Dm rum Wheat 

Flax i ! 

nil let . ; i 

Oats ! S 

Peanuts | j 

Potatoes 

Rice i I 

Rye * • | 

Sugar Beets 
Sugar Cane 
Safflower 

Soybeans I • ! 

Sorghum , ; 

Sunflower 1 j 

Spring Wheat! 

Tobacco 
Vegetables ( 

Winter Wheat; 

Small Grains/Strip Fields 

* ! i 


Grasses j 

Other Hay 
Orchard/Vineyards 
Pasture ; 

Trees , # ‘ i 

Water > 5 acres 
Non- Agriculture 
Idle Land/Failow 
Previous Year 
Res iduc/Stubble 
Mixed Crop in Field 
Problem Field 
Non- Inventoried Land 


*0pen--to bejassigned as needed (thtough code 130). Other open codes' include 137 
through 1 39, ‘141 through ; 150, 131 through 200, 23-1 through 254. 


1 


] 

i 
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UNIVERSAL FORMAT TAPE HEADER RECORD FORMAT (3060 Bytes) 

BYTE 

cofrrnrrs 

. description 

1-32 

lacie YiJ)Prj5...U 

Computing system id-ESCBIC 

•33-33 

xcoca 

6- digit unload tape number 

•39-52 

PYYBddkksisstk 

KUIOD (E3CI0XC) 

53-60 

ERTCY;;ss)5«..tf 

Sensor id-EBCOIC 

61-63 


Bate of this tape generation 

61 


Bay of month - Binary 

62 


Month number - Binary 

63 


Year - last 2 digits - Binary 

& 

8 

Bally t?pe serial number - Binary 

65-66 


BRTS nission number - Binary 

1 « ERTS A 

2 « ERTS B 

67-68 


Site - Binary (sample segment number) 
idange 1-5C00 

6? 

00000000 

Line - Binary 

70 

00000000 

Run - Binary 

71-72 


Orbit number of nev data - Binary 

73-80 


Tine of first scan in this job (fcr LACIE 
thi« is the time of the center scan of the 
ERI*i scene containing the semple se^tent 
to the last ten seconds) 

73-7 


Tenths of seconds x 1000 - Binary- 

75 


Seconds - Binary 

76 


Minutes - Binary 

77 


Sours - Binary 

78 


Bay of month - Binary 

79 


Month number - Binary 

60 


Year - last 2 digits - Binary 

1 61-68 

i 

i 

i 


Bands active in this Job, 1 bit per band 
left to right ("S3 to LS3) . Video data 
alvays appears in the order indicated 
here. 1 = active. 

81 

llllOOOO 

Bands 1, 2, 3, k active 

82-88 

0 

Bands 5-6L not applicable to LACIE 

89 

0 

froeessins flag - rav data - Binary 

90 

k 

Kumber of bands in this Job - Binary 


I 

1 

I 
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BYTE 

CONTENTS DESCRIPTION 

, 

91 

a 

Busier of bits in a picture elesent - Bin 


92-93 

i 

Addres; of start of video data gives loco' 


. 


of start of video within scan - Binary 


9*-95 

0 

Address of start of first calibration area 
within the scan - Binary 


96-9T 

196 

Busier of video elements per scan within 
a single land - Binary 


98-99 

0 

Busier of calilration eleaents in the first 
1 calilration area within the scan in a 

single land - 3inary 


100-101 

900 

Physical record size in bytes - Binary 

» 

102 

0 

Punier of bands per physical record of data 

1 


• 

set starting with the second record of the 



' 

data set - Binary 

t 

m 

1 

ir 

•103 

0 

Eualcr of physic:! records per scan per 
brand - Binary. Zero unless the elements 
per band is greater than 3K. 

10li 

1 

Punier of records to cake a complete data 
set - Binary. 


>* 

i 

105-106 

70 

Length of ancillary llock in bytes - Binary 

¥• 

107 

0 

Bata order indicator - Binary 0 * video 

t* 

l 



ordered by bind 

L 

r 

108-10? 

1 

Start pixel number number of the first 
pixel per scan cn this tape referenced to 
the start of the scan - Binary 

l. 

110-111 

196 

Stop pixel number number of the last pixel 




per scan on this tape referenced to the 

t 

i 



start of the scan - Binary 


112-623 


Coefficients end exponents-of-tea to linearly 

^ * 



translate parameter values from up to 6* 

{ 

i , 



bands to engineering units. Two bytes per 

r 



coefficient or exponent vith each pair of 
bytes expressed in signed binary. (MS3 a 
sign bit: 0=+, 1=-. (Remaining 15 bits 

«* 



straight binary). 

V 

t 

112-119 

0 

AO coefficients for bands 1-* 

1 

• » 

120-239 

0 

Sands 5-6* not applicable to LACI? 

K • 

1 

2*0-2*7 

0 

£0 expenents of ten for bands 1-* 

1 

2*8-367 

0 

Bands 5-6* not applicable to LACIE 

i 

368-369 

1 

A1 coefficient for band 1 

\ 

i . 

i 

370-371 

- - 

* 

■ 

At coefficient for band 2 


182 

BYTE CONTENTS 

371-373 1 

37U-375 1 

37^95 0 

1*96-503 0 

JOli-629 0 


«.<ik d&rF 13 


dfecbtptio:: 

♦ ■ 1 

AL coefficient for band 3 
A1 coefficient for band U 


Binds 5-61* not applicable to LACI2 

£1 exponents of ten for bands 1-1* 

Bands 5-61* not applicable to LtCIE vfcere 
for each band Y = Engineering Units, C - 
Parameter Value: I = A *10**2 + C*A 
• 10**2 


52l»-687 


688-751 

752 

753 


To be supplied by JSC £olor code information - one byte per 

band in same order as channel active on 
this tape" indicator - 2 in ary. 0 * no 
color assignment 

0 Scale factor - one byte per band in same 

order as "channel active on this tape" 
indicator - Binary 0 = not active 

q Offset constant - Binary 

1 $ Word site of generating computer. Tnis is 

the smallest quantity in bits that the 
computer can vrite on tape. 


75^-1777 


7511-769 0000050000000600 

770-785 0000060000000700 

786-801 C00007 0000000300 

802-817 ooocoSooooooiioo 

810-1777 0 

1778 1 

1779-1780 0 

1781-1782 0 


1783 0 

178U O 

1785-1786 U 


Shortest and longest vave-length of each 
band - EBCDIC. Eight bytes per limit, l6 
hyt« per band - mill microns 

Band 1 - EBCDIC 

Band 2 - EBCDIC 

Band 3 - EBCDIC 

Band U - EBCDIC 

Bands 5-6U not applicable to LACXF - EBCDIC 

Humber of data sets per physical record - 
Binary 

Address of start of second calibration 
vithin a scan - Binary 

Humber of calibration elements in the, second 
calibration area vithin the scan in a single 
bend - Binary 

Calibration source indicator - Binary 
Till zero. 

t 

Humber of bands iu the first record of the 
data set - Binary 

Total number of elements per scan, per band - 
Binary 


1787-1788 196 
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BtTE 

1789-1730 


1791-1791 


1793 - 291*0 


1793-2086 

2087-2181* 

2087-2091* 

2095-2102 

2103 

2101 * 


2105-2106 

2107-2109 

2110 

.2111 

2112-2120 

2121 

2122 


2123-2131 


ORIGINAL PAX ~ 
coirrcrrs eescri Priori 0F H00R QUALITY 

1 Pixel ski? filter - the quantity to he 

to the number of the last pixel process 
jrield the number of the next pixel to b* 
processed - Binary 
X « Process every pixel 

1 Scan skip fectcr - the quantity to he adA 

to the number of the last scan processed 
yield the nutter of the next scan to he pro- 
cessed - Binary. 

1 c Process every scan 

General information. Information in EE CD If 
generated to satisfy user requirements. 

Contents vill be unique for each user end 
. depend not only on the ser.scr, but also cn 
specifications cf the user 1 for vhem the tape 
Is generated. Bytes Tor vhiete user specific 
so requirements vill contain fill zeros. 

Fill zeros 

General annotation byte assignment for .IRIS 
LAC IE 

-X.XX30DC Peek sharpness — EBCDIC 

■X.XXXXX normalized peak to background ratio - EBCDIC 

Manual registration flag 

0 » Automatic 

1 * Manually assisted 

Zero fill flag - Binary 

0 B The sample segment contains no zero fill 
data 

.1 « Part of the sample sequent contains zero 
fill data 

Orbit number of reference data set - Binary 
(not used = 0 ) 

Zero fill 

Cloud cover - Binary - percent of 10X11 1W 
search area covered by clouds 

Zero fill 

IBIS scene/frame id number for reference data 
set - EBCDIC - ADDDKSCS (see bytes 2123-2131 
for content) 

Zero fill 

T^ag indicatir .5 vhether a reference scene has 
been used for registration - Binary 

0 * hasn't been used 

1 *■ has been used 

ERTS scene-frame id number for nev dnta-EBCDIC 
ADDD1DCC1S 


\ 
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BYTE COriTEirre 

2123 

2124-2126 


2127-2128 

2123-2130 

2131 

2132 ; 

2133 

1 

j 

2134-2145 

i 

I 

2134- 2139 ' 

2134 I 

2135- 2137 | 

2138-2139 1 

2ilj0-2ll»5 

2l40 i 

21U-21U3 | 
2144-2145 1 
214 6-2149 ! 

21 1*6 

2147 

2148 
ai»9 

2150-2155 | 
2157-2170 ! 

2157-2162 i SUI.75ZL 
2163-2164 | 

2165-2167 | HAZ 
.2168-2170 | 
2171-2178 j 

l 

* 

I 

2179-2184 .j 


i«escm pri or: 

A * EP.TS aisiion aucber 

HDD « Day number relative to launch at time 
of observation 

HR c hour rx time of observation 
>M *= nir.ute at tine of observation 
6 “ tens of seconds at tine of observation 
' Zero fill 

■Data quality classification 
j 0 “ acceptable 
J. «* narginel 

| ' Center of sample segment - EBCDIC right 
j Justified and padded vith zeros 

latitude 

i *H" •» North "3" = South 

i 

| Degrees - integral 
| Minutes - integral 
longitude 

"B" « East; "W" = West 

! Degrees - integral 

i 

Minutes - integral 

! Band sync status - Binary - the number of 
i lines for vhich sync could not be cair.tair.ec 
j. during pre-processing by band 

| Band 1 

Band 2 

Band. 3 

Band 4 

Zero fill 

I 

j Bun angle - EBCDIC 
| "SUN EL" - EBCDIC 

I Bun elevation - integral degrees EBCDIC 
j • *AZ H - EBCDIC 

i Sun eziauth - integral degrees - EBCDIC 

1 tTlne and date of last update to controlling 
J Inforcation - E3CDIC - YDDDHiCM 

Zero fill 

i 

l 


I 

t 



BYTE COCTD.TS 

* 2201-2202 

*2203-2204 

*2205-2206 

*2207-2208 
•* % 

: *22*9 «DDD 

*J?2254 X 

*2257 Y3TDDD 

. *2262 X 

•2265 ttEDD 

•2270 X 

*2273 Y7DDD 

*2278 X 

2551-2642 0 

2643-2940 

26^3-2658 


* 261 * 3 - 261*6 
•26U3-26U 
•261*5-261*6 
*261*7-2650 
■261*7-261*8 
*261*9-2650 

: *2651-2651* 
12651-2652 
•^2653-2654 
•2655-2658 
•2655-2656 
•2657-2658 
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JSun angles ere 2 byte binary 
Sun angle for BSZG channels 1-1* 

Sun angle for RSIG channels 5-8 
Sun angle for RSEG channels 9-12 
, Sun angle for I.SZG chennels lj-l6 
1st acquisition date (characters) 

. ' Average soil greenness for 1st acquisition 
(binary number) 

2nd acquisition date or blanks 

Average soil greenness for 2nd acquisition 

■3rd acquisition date or blanks 

Average soil greenness for 3rd acquisition 

Itth acquisition date or blanks 

Average soil greenness lor l*th acquisition 

General annotation byte assignments for 
the cyber at JSC 

General arxotation byte assignments for the 
production film converter 

•Sl*W factors and scaling factors - signed 
Binary. Four bytes per channel, vhere first 
tvo bytes *= bias factor; second tva bytes * 
scaling factor. Each factor has an implied 
decimal point to the left of the least 
significant decimal digit. If MS3 * 1 
the factor is negative; if the MSB * 0 
the factor is positive. 

Channel 1 

Bias factot 

Scaling factor 

Channel 2 

. bias factor 

-Scaling factor 

^Channel 3 

^ Bias factor ~ — 

- Scaling factor 

- Channel 4 
Bias factor 
Scaling factor 
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BYTE 

*265?-2606 

2758 


corrrcrrs 


ORIGINAL P ft SE J8 
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27>9 1 

•2760-2783 
*2784-2789 
2790-2792 0 

2793-279 1 * 0 

2795 1 


2796 0 

2797 1 


2798 0 

2799 2 


2800-2307 


kescbipttc:! 

Bias factor and scaling factors for 
channels 5-16 in the sane fomat as above. 


H thousand scan lines per fraae - Binary 
User ID 
■ Blanks 

Altitude in cetera - Binary 

Ground speed in MET/ SEC - Binary 

Bee*; Type - 3inary 
00000000 = ?.av data 
00000001 = Smoothed data 

Angle of ABC in degrees - Binary 

Cecera - 3inary 
00000000 = 70 MI 
00000001 = 5 inch 

Input device - Binary 
00000000 = 9-track. 

00000001 = high density tape 

Truncation 

0*2 lov order bits 

1 = 2 high order b"ts 

2 *» no truncations 

Channels requested. 1 bit per channel - 
Binary 


■£600-2801 11110000 00000000 (l'acq) 
HU1111 00000000 (2 acq) 
11101111 1111GCC0 (3 ace) 
11111111 nniill ^4 acq) 

2802-2607 0 

2808 0 


2809-2824 0 


Channels 1, 2, 3, 4 requested 


Channels 16-64 not applicable for Unload 

Processing code - Binary 
00000000 = serially 
00000001 = concurrently 

Density for eight saturated colors - tvo 
bytes per saturated color - Binary vhere 
first byte = lov intensity level of the 
range; second oytc = high the range of the 
Intensity level is 0 to 255 


2809-2310 

2611-2812 

2813-2814 

2815-2816 

2817-2818 

• 2819-2820 


Bed density range 
Blue density range 
Green density range 
Magenta density range 
Cyan density range 
Yellov density range 
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BYTE 

2821-2322 

2823-2824 

282? 

282$-2d73 

2874 


2075 


2876 


2877 


2878-2881 

2878-2679 

2880-2881 

2862-2883 

2884 

2885-2386 

2885 

2886 

2887-2940 
• 2941-3000 
3001-3060 


coirrsrrrs 


To Be supplied by JSC 


0 

0 


6 


0 

i 

0 

'0 


iLACIEftlDPr*...# 
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DESCRIPTION 
Waite density range 
Slack density range 

Filn processing flag 

0 ** Process this file 

1 « Skip this file 

Fill zero 

Color select* - Binary 
D * Do color 

1 *> Assigned color 

2 ** False color 

3 “ Saturated color 

Image forr.at* - Binary 

0 E Single image 

1 * Enhanced images 
2*“ Abut images 

3 “ Offset images 

Repeat of pixels per seas - Binary 
0 * None 
1*1 repeat 
2*2 repeats 
S * a repeats 

Repeat of scan - Binary 

0 *= none 

1 * 1 repeat 

2 « 2 repeats 
a * n repeats 

Partial scan ~ Binary 

6tart pixel number 

Stop pixel numVer 

(if bytes 2767-2861 contain Ml zeros, 
full scan is expected - not partial) 

Sensor scan rate in scans/secood - Binary 

Pixel size - Binary 

Angle of drift — Binary 

— integer degrees 

fraction 

Till zeros 

Title - user designated identification 

^"113. zeros, makes the record an integral 
number of computer words. These bytes 
Bust never contain data. 


I 
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3.2.2 DATA SETS 

The data follows the Header Record and is arranged in data sets. 

A data set is defined as the ancillary data and all of the video 
data for one scan line for all active channels. Data sets are 
recorded in variable length physical records containing a 
maximum of 3000 bytes of information per record. Since 3000 bytes 
Is not compatible with the word length of all computers, the 
record includes a sufficient number of fill zero to make the record 
divisible by 32, 36, 48, and 60 bits. However, the maximum 
length of the record may not exceed. 3060 bytes. If two are more 
records are needed for the data set, the data set will be divided. 
Under no condition will the data for a video channel begin in one 
record ‘and continue into another record. 

The first two bytes of each record will contain the number of the 
physical record within the video data set. This is for use in 
data sets that contain more than 3000 bytes and therefore require 
more than one physical record for recording. The ancillary block 
is the first block of a data set and follows the record counter. 

The length of the ancillary block is variable, with the number 
of bytes civeh in the header record. 

Bytes 73 through N will be dependent on whether this job contains 
raw processed data (Byte 89 of the header record) . The value 
of N will be given in bytes 105 and 106 of the header record and 
will always be greater than or equal to 70. 

If this job contains raw data bytes 73 through N will contain the 
housekeeping data channel from the sensor, if one is available. 

Following the ancillary data in each data set will be the video 

data for the one channel for one scan. The video data for the 
channel for cne scan will comprise a video block. 
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UNIVERSAL FORMAT TAPE ANCILLARY BLOCK FORMAT 


byte 

1-68 

69-70 


CONTENTS * DESCRIPTION 

0 Zero fill 

•Relative scan line auaber 


ORIGINAL vau.: ^ 
OF. POOH QUALl i » 


UNIVERSAL FORMAT SCAN LINE FORMAT (900 Bytes) 


ORIGINAL PACE IS 
OF POOR QUALITY 


DATA SET 1 Variable r.um- 


NoleJ The number of bytes for each 
data set will be the same in 
each data run. 


e (DATA RUN' l) 

jnc 

DATA SET 6 (line 6 ) Van- 
able ler.pth 

IRC 


ONE SAMPLE SEGMENT 


IRC I 

DATA SET 117 ({me 1 17) 
‘EOF - - ; 

HEADER (DATA RUN Z) 

tOP . . 

HEADER 
EOF ! 

EOF i | - 

EOF 1 < 


A and B are retrieved from the image 
data base- 

A= LSIMAGHD , master header 

B» lsimvhdr. lsimchak. lslmlakd, 

imagery data 


RECORD COUNTfP kbytes 

ANCILLARY E-OC*' 70 bvtes 
LINE i t BANG I 19S bytes 

BAND 2 196 bytes 

BAND 3 19S bytes 

B ANjl_4 12s JyftEi 


•data set for 1 acquisition. 
4 CHANNELS . j 

900 BYTES/RECORD i 


\ 

TIgure 3-15 • FFC Unload Tape (sheet 1 of 2} 


i 
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(b 3 ) 


pFCORP COUNTER,. 

2 bytes 

ANCILLARY BLOCK 

“? byt«?s 

LINE 1. BAND i 

19o oytes 

BAND 2 

196 bytes 

BAND 3 

196 bytes 

BAND 4 

196 bytes 

BAND 5 

. 196 bytes 

BAND 6 

196 bytes 

BAND 7 

196 bytes 

BAND 8 

196 bytes 

BAND 9 

196 bytes 

BAND 10 

196 bytes 

BAND 11 

196 bytes 

BAND 12 

196 bvtes 

ZERO FILL 

96 bytes 


DATA SET FOR THREE ACQUISITION, 
12 CHANNELS 

2520 BYTES/RECORD 


—Note: For a 16-channcl data set, two (3^) data sets will be required therefore 

requiring two physical records. 


Figure 3-15. PFC Unload Tape (Sheet 2} 
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